Digital Communication Dynamics Dataset
Social Media and Networking
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset, named 'Tate & Morgan Viral Interview: 50K YT Comments', offers a unique insight into public discourse and sentiment [1]. It is a substantial collection of 50,000 comments harvested from YouTube, specifically from the 'Piers Morgan Uncensored' channel, featuring discussions between Andrew Tate and Piers Morgan [1]. This resource is invaluable for sentiment analysis, hate speech detection, and linguistic pattern recognition, providing an unfiltered view of diverse public opinions and reactions. It serves as a rich source for researchers and data scientists aiming to comprehend digital communication dynamics [1].
Columns
- Comment: The actual text of the user comment from YouTube, presenting raw and unedited expressions of public opinion [1, 2].
- Anonymised Author: Unique SHA256 hashed identifiers for each commenter, ensuring privacy while maintaining the ability to track unique identities [1, 2].
- Published At: A timestamp indicating precisely when each comment was posted on YouTube, offering insights into temporal engagement patterns [1, 2].
- Likes: The numerical count of 'likes' received by each comment, serving as a clear indicator of its popularity or resonance with the audience [1, 2].
- Reply Count: The total number of replies to each comment, reflecting its level of engagement or potential controversy [1, 2].
Distribution
This dataset contains 50,000 individual comments, forming 50,000 records or rows [1]. The data is typically provided in a CSV file format [3]. The 'Anonymised Author' column features 48,967 unique identifiers, while the 'Published At' column includes 50,185 unique timestamps [4]. The comments were gathered over a period from 22nd November 2023 to 18th December 2023 [5]. Analysing the 'Likes' column, the vast majority (50,018 comments) received between 0 and 421.90 likes, with a maximum recorded value of 8438 likes [5]. For 'Reply Count', 49,668 comments had between 0 and 5.85 replies, with the highest reply count observed at 117 [6].
Usage
This dataset is ideally suited for advanced Natural Language Processing (NLP) tasks, including:
- Sentiment analysis to gauge public emotion and opinion [1].
- Hate speech detection for identifying problematic online content [1].
- Socio-linguistic studies exploring language use in digital environments [1].
- Exploring how online discourse shapes and reflects public opinion on prominent figures and current affairs [1]. It is particularly encouraged for academic research and knowledge discovery purposes [1].
Coverage
This dataset focuses on public comments from YouTube, specifically from the 'Piers Morgan Uncensored' channel's interviews with Andrew Tate [1]. The data covers a time range from 22nd November 2023 to 18th December 2023 [5]. Geographically, as YouTube comments, the data has a global scope [7]. There are no specific notes on data availability for certain demographic groups as the authors are anonymised [1, 2].
License
CC BY-NC
Who Can Use It
This dataset is intended for:
- Researchers interested in digital communication dynamics, online discourse, and public sentiment [1].
- Data scientists seeking rich data for NLP tasks, particularly sentiment analysis and hate speech detection [1].
- Individuals engaged in academic research and knowledge discovery related to online interactions and public opinion [1].
Dataset Name Suggestions
- YouTube Comment Sentiment Analysis Data
- Online Political Discourse Comments
- Piers Morgan Tate Interview Public Reactions
- Digital Communication Dynamics Dataset
- Social Media Opinion Data
Attributes
Original Data Source: Tate & Morgan Viral Interview: 50K YT Comments