MrBeast YouTube Comments Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset encapsulates the vibrant community interaction surrounding MrBeast's immensely popular YouTube video, "$456,000 Squid Game In Real Life!" By meticulously compiling 100,000 comments, this collection offers a unique window into the public discourse, engagement, and sentiments of one of YouTube's most significant viral phenomena. It is a valuable resource for understanding audience response to viral content and public interaction. Recognising the importance of privacy and ethical data handling, user names have been anonymised using SHA-256 encryption, where each username is first salted with a unique sequence.
Columns
- Comment: The full text of the user's comment, reflecting their thoughts, reactions, or interactions with the content.
- Anonymised Author: The SHA-256 hashed representation of the user's name, ensuring anonymity while maintaining a unique identifier for each commenter.
- Published At: The ISO 8601 timestamp marking when the comment was originally posted, offering insights into the timing and relevance of user interactions. Data ranges from 25th November 2021 to 26th December 2023.
- Likes: The number of likes attributed to the comment, serving as an indicator of its resonance or approval among the community. The majority of comments have between 0 and 49,819 likes, with a few reaching up to 996,380.
- Reply Count: The count of replies to the comment, reflecting its capacity to engage and provoke discussion within the community. Most comments have between 0 and 25 replies, with a few having up to 501 replies.
Distribution
This dataset comprises 100,000 unique comments, typically distributed as a data file in CSV format. Specific numbers for rows or records beyond the 100,000 comment count are not explicitly detailed in the provided information.
Usage
This dataset is a rich resource for various analytical pursuits:
- Sentiment analysis: Understanding the overall sentiment expressed in comments.
- Linguistic trends: Identifying evolving language patterns and popular phrases.
- Engagement patterns: Analysing how viewers interact with content and each other.
- Sociocultural research: Delving into the depth and diversity of viewer conversations to draw insights and patterns from a broad swath of public opinion.
- Academics can explore the dynamics of viral content and public interaction.
- Marketers and content creators can gauge audience response and engagement strategies.
- Training machine learning models in natural language processing (NLP) and understanding, providing real-world text data in a diverse, dynamic context.
Coverage
The dataset covers community interactions globally, reflecting a broad spectrum of public opinion from viewers of MrBeast's video. The time range for the comments spans from 25th November 2021 to 26th December 2023, with data available across this period, including significant concentrations in late 2023.
License
CC-BY-NC
Who Can Use It
- Researchers: For in-depth analysis of online communities and viral phenomena.
- Linguists: To study language use in digital contexts.
- Marketers: To understand audience engagement and sentiment for content creation and strategy.
- Sociocultural Analysts: To explore public discourse and social dynamics.
- Academics: For studying viral content and public interaction.
- Content Creators: To inform their content strategies based on audience response.
- Machine Learning Engineers/Data Scientists: For training NLP models with real-world text data.
Dataset Name Suggestions
- MrBeast YouTube Comments
- Viral YouTube Comment Engagement
- YouTube Squid Game Video Discourse
- MrBeast 100K Comments Data
- YouTube Viewer Interaction Dataset
Attributes
Original Data Source: Mr Beast: Most Viewed YT Video 100K Comments