Opendatabay APP

YouTube Educational Content Feedback

Social Media and Networking

Tags and Keywords

Business

Education

Tabular

Beginner

Text

Nlp

Trusted By
Trusted by company1Trusted by company2Trusted by company3
YouTube Educational Content Feedback Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a collection of comments from popular YouTube channels focused on data science and programming. It includes comments extracted from selected playlists and their top videos from channels such as CampusX-official, Freecodecamp, and Sendex. The data is suitable for analysing audience engagement, understanding sentiment, and identifying trending topics within the data science education community on YouTube. It is a valuable resource for researchers, educators, and content creators aiming to gain insights into online learning interactions.

Columns

  • snippet_topLevelComment_id: A unique identifier for the top-level comment.
  • comment_ID: Another identifier for the specific comment.
  • snippet_topLevelComment_snippet_videoId: The identifier of the YouTube video associated with the comment.
  • Video_ID of the comment: The video ID to which the comment belongs.
  • snippet_topLevelComment_snippet_textDisplay: The text of the comment as it appears on the YouTube webpage.
  • snippet_topLevelComment_snippet_textOriginal: The original, unformatted text of the comment.
  • snippet_topLevelComment_snippet_viewerRating: The rating given by viewers to the comment.
  • snippet_topLevelComment_snippet_likeCount: The total number of likes received by the comment.
  • snippet_topLevelComment_snippet_publishedAt: The date and time when the comment was originally published.
  • snippet_topLevelComment_snippet_updatedAt: The date and time when the comment was last updated.
  • snippet_canReply: A boolean indicator specifying whether replies are open for the comment.

Distribution

The dataset primarily contains comments from YouTube, typically in a tabular format suitable for CSV files. It includes approximately 9,014 individual comment records. The comments are organised by video, with up to 100 comments selected per video and up to 50 videos per channel playlist. The distribution of comments by various internal labels shows concentrations in lower ranges, with 8,949 comments falling within the 0.00-231.60 range, and one outlying comment in the 2084.40-2316.00 range. For reply availability, 8,945 comments allow replies, while 69 comments have no specified reply status.

Usage

This dataset is ideal for:
  • Natural Language Processing (NLP) tasks, such as sentiment analysis, topic modelling, and keyword extraction from user comments.
  • Educational research, to study engagement patterns and feedback mechanisms in online data science courses.
  • Content strategy development for YouTube creators in the data science domain, helping them understand audience interests and pain points.
  • Market research to identify trends and popular concepts within the data science and programming community.

Coverage

The dataset covers comments from YouTube data science channels globally, spanning a time range from 8th March 2018 to 25th September 2022. The included channels are prominent in the data science and programming education sphere, ensuring a focus on the demographics interested in these subjects.

License

CC0

Who Can Use It

This dataset is suitable for:
  • Data scientists and NLP engineers for text analysis projects.
  • Academic researchers studying online communities, education technology, or digital content engagement.
  • Educators and e-learning platforms to understand student interaction and content effectiveness.
  • Beginners in data science looking for a real-world text dataset for practice.

Dataset Name Suggestions

  • YouTube Data Science Comment Activity
  • Online Learning Engagement Data
  • Data Science Channel Comments Archive
  • YouTube Educational Content Feedback

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

26/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format