Opendatabay APP

YouTube Video Popularity & Comments Dataset

Reviews & Ratings

Tags and Keywords

Tabular

Data

Nlp

Popular

Trusted By
Trusted by company1Trusted by company2Trusted by company3
YouTube Video Popularity & Comments Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset is designed for analysing the relationship between the popularity of a video and the most relevant or liked comments it receives. It includes two distinct files: one containing basic video statistics such as title, likes, views, keywords, and comment counts, and another detailing the top ten most relevant comments for each video, along with their sentiment and associated likes.

Columns

videos-stats.csv:
  • Title: The title of the video.
  • Video ID: A unique identifier for the video.
  • Published At: The date the video was published, formatted as YYYY-MM-DD.
  • Keyword: The keyword linked to the video.
  • Likes: The number of likes the video has received. A value of -1 indicates that likes are not publicly visible.
  • Comments: The total number of comments on the video. A value of -1 indicates that the video creator has disabled comments.
  • Views: The total number of views the video has accumulated.
comments.csv:
  • index: An ordinal index for the comment record.
  • Video ID: The identifier for the associated video.
  • Comment: The full text of the comment.
  • Likes: The number of likes the comment has received.
  • Sentiment: The sentiment expressed in the comment. A value of 0 signifies a negative sentiment, while 1 represents a neutral sentiment, and 2 indicates a positive sentiment.

Distribution

The dataset is provided in CSV format and comprises two files: videos-stats.csv and comments.csv. The comments.csv file contains information regarding comments for each video identifier, with typically 10 comments associated per video ID. The comments.csv file holds approximately 18,409 records. The videos-stats.csv file is expected to contain metadata for 17,995 unique video identifiers.

Usage

This dataset is ideal for various analytical and machine learning applications, including:
  • Sentiment analysis of comments.
  • Text generation based on comment content.
  • Predicting video likes from comment information.
  • Analysing video popularity by keywords.
  • Forecasting video views using comment details and video statistics.
  • In-depth exploratory data analysis of video and comment data.

Coverage

The dataset has a global reach, with no specific geographic limitations mentioned. The Published At column provides date information for videos, although a specific time range for the entire dataset is not provided in the sources. No specific demographic scope is detailed within the provided information.

License

CCO

Who Can Use It

This dataset is suitable for data analysts, researchers, and developers focusing on social media analytics, natural language processing (NLP), and understanding content popularity. It can be valuable for projects involving comment sentiment analysis, video popularity prediction, and general exploratory data analysis of video platforms.

Dataset Name Suggestions

  • Youtube Statistics
  • Video Comment Sentiment Analysis
  • YouTube Video Popularity & Comments
  • Social Media Video Insights
  • Video Engagement Metrics

Attributes

Original Data Source: Youtube Statistics

Listing Stats

VIEWS

1

DOWNLOADS

1

LISTED

05/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format