YouTube Video Popularity & Comments Dataset
Reviews & Ratings
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is designed for analysing the relationship between the popularity of a video and the most relevant or liked comments it receives. It includes two distinct files: one containing basic video statistics such as title, likes, views, keywords, and comment counts, and another detailing the top ten most relevant comments for each video, along with their sentiment and associated likes.
Columns
videos-stats.csv:
- Title: The title of the video.
- Video ID: A unique identifier for the video.
- Published At: The date the video was published, formatted as YYYY-MM-DD.
- Keyword: The keyword linked to the video.
- Likes: The number of likes the video has received. A value of -1 indicates that likes are not publicly visible.
- Comments: The total number of comments on the video. A value of -1 indicates that the video creator has disabled comments.
- Views: The total number of views the video has accumulated.
comments.csv:
- index: An ordinal index for the comment record.
- Video ID: The identifier for the associated video.
- Comment: The full text of the comment.
- Likes: The number of likes the comment has received.
- Sentiment: The sentiment expressed in the comment. A value of 0 signifies a negative sentiment, while 1 represents a neutral sentiment, and 2 indicates a positive sentiment.
Distribution
The dataset is provided in CSV format and comprises two files:
videos-stats.csv
and comments.csv
. The comments.csv
file contains information regarding comments for each video identifier, with typically 10 comments associated per video ID. The comments.csv
file holds approximately 18,409 records. The videos-stats.csv
file is expected to contain metadata for 17,995 unique video identifiers.Usage
This dataset is ideal for various analytical and machine learning applications, including:
- Sentiment analysis of comments.
- Text generation based on comment content.
- Predicting video likes from comment information.
- Analysing video popularity by keywords.
- Forecasting video views using comment details and video statistics.
- In-depth exploratory data analysis of video and comment data.
Coverage
The dataset has a global reach, with no specific geographic limitations mentioned. The
Published At
column provides date information for videos, although a specific time range for the entire dataset is not provided in the sources. No specific demographic scope is detailed within the provided information.License
CCO
Who Can Use It
This dataset is suitable for data analysts, researchers, and developers focusing on social media analytics, natural language processing (NLP), and understanding content popularity. It can be valuable for projects involving comment sentiment analysis, video popularity prediction, and general exploratory data analysis of video platforms.
Dataset Name Suggestions
- Youtube Statistics
- Video Comment Sentiment Analysis
- YouTube Video Popularity & Comments
- Social Media Video Insights
- Video Engagement Metrics
Attributes
Original Data Source: Youtube Statistics