Opendatabay APP

YouTube Video Performance and Influencers

Social Media and Posts

Tags and Keywords

Youtube

Talk

Views

Engagement

Social

Trusted By
Trusted by company1Trusted by company2Trusted by company3
YouTube Video Performance and Influencers Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Data consists of YouTube activity metrics collected from the official channels of the top six American late-night talk shows. This television culture staple has achieved a significant global digital reach, with some channels commanding over 20 million subscribers, highlighting their influence on the platform. The information is structured on a per-show basis and includes video titles alongside quantitative engagement statistics like views, likes, and comments. The primary motivation for compiling this data was to study how elements like the video title or the appearance of a specific celebrity influence the video's engagement rate.

Columns

The data file for the Conan show includes 10 relevant columns out of 13 available fields. Statistics are based on approximately 8,725 records:
  • publishedAtSQL: The date and time the video was published. This field is 100% valid.
  • videoTitle: The title assigned to the video, containing 8,710 unique entries. This field is 100% valid.
  • videoDescription: The text description accompanying the video, with 8,702 unique descriptions.
  • videoCategoryId: The numeric category code assigned by YouTube, with a mean value of 23.2.
  • videoCategoryLabel: The descriptive label for the category, most frequently defined as Comedy (75%) or Entertainment (23%).
  • durationSec: The length of the video in seconds, with a mean duration of 225 seconds and a maximum length of 3,596 seconds.
  • definition: Indicates the video quality, which is predominantly High Definition (HD, 99%).
  • caption: A boolean field indicating whether captions are provided. Captions are available for approximately 3% of the videos.
  • licensedContent: A field indicating if the content is licensed.
  • viewCount: The total number of views the video has accrued, with a mean of 786,000 and a maximum recorded view count of 108 million.

Distribution

The material is organized into files specific to each talk show channel, with the example file, Conan.csv, being 5.99 MB in size. This file contains 10 of the 13 available columns, tracking approximately 8,725 records. Core time-based and identifying metrics are highly valid, generally maintaining 100% completeness. The information captures statistics as of 13th June 2020. The expected update frequency is Never.

Usage

This resource is ideal for machine learning and analytical projects focused on predicting video engagement. It can be used to model how descriptive factors, such as video titles or category labels, influence quantitative outcomes like views, likes, dislikes, and comments. Analysts can test hypotheses regarding the impact of celebrity guests on viewership and engagement rates.

Coverage

The scope covers YouTube video data gathered from the top six American late-night talk show channels. The statistics reflect the performance of these videos up until 13th June 2020. Geographic coverage is effectively global, reflecting the digital reach of these American cultural institutions.

License

CC0: Public Domain

Who Can Use It

The dataset is intended for users interested in social networks, arts and entertainment, and data science professionals conducting predictive modelling on digital media performance. The material has a maximum usability rating of 10.00.

Dataset Name Suggestions

  • American Late Night Talk Show YouTube Performance
  • Global Talk Show Engagement Metrics
  • YouTube Video Performance and Influencers

Attributes

Listing Stats

VIEWS

8

DOWNLOADS

2

LISTED

17/12/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in ZIP Format