Soccer GOAT Debate Tweets
Sports & Recreation
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset comprises tweets related to "GOAT" (Greatest of All Time), specifically focusing on football players such as Messi, Ronaldo, or Mbappé. The data was collected using a daily scheduled script via Tweepy, employing hashtags for targeted football players and "GOAT" as collection criteria. Its primary purpose is to enable analysis to determine which football player is most positively perceived, has a larger follower base, or is more likely to be considered the overall best.
Columns
- id: Unique identifier for the tweet.
- user_name: The user's display name on the platform.
- user_location: The geographical location specified by the user.
- user_description: The biographical text provided by the user.
- user_created: The date and time when the user's account was created.
- user_followers: The number of followers the user has.
- user_friends: The number of accounts the user is following.
- user_favourites: The total number of tweets the user has liked.
- user_verified: A boolean indicating if the user's account is verified.
- date: The date the tweet was posted.
Distribution
The dataset typically comes in a CSV file format. It contains 22,190 individual tweets (records). Data collected through the daily script ranges from 21st January 2023 to 25th March 2023. User account creation dates within the dataset span a much wider period, from 24th November 2006 to 25th March 2023. The majority of user locations (57%) and descriptions (86%) are categorised as 'Other', while 97% of user accounts are not verified.
Usage
This dataset is ideal for:
- Analysing public sentiment towards different football players to identify who is considered the "Greatest of All Time".
- Investigating fan engagement and follower demographics related to top athletes.
- Social media trend analysis within the sports domain.
- Natural Language Processing (NLP) tasks such as sentiment analysis, topic modelling, and keyword extraction on sports-related text.
- Developing machine learning models to predict player popularity or identify key influencers.
Coverage
The dataset offers global coverage as tweets are collected internationally. The tweets themselves were collected from 21st January 2023 to 25th March 2023. However, the user accounts from which these tweets originate were created between 24th November 2006 and 25th March 2023, providing a historical context for user activity. There are no specific demographic notes beyond the user verification status, where 97% of users are unverified.
License
CC0
Who Can Use It
- Sports Analysts and Journalists: To gauge public opinion and narrative around top football players.
- Researchers: Studying social media dynamics, fan behaviour, and sports discourse.
- Data Scientists and NLP Engineers: For training and testing models related to text analysis and sentiment classification.
- Marketing Professionals: Interested in understanding brand perception and audience engagement in the sports industry.
- Football Clubs and Player Agents: To monitor public sentiment and manage player image.
Dataset Name Suggestions
- Football GOAT Tweets
- Greatest Of All Time Football Tweets
- Soccer GOAT Debate Tweets
- Top Footballer Tweet Analysis
- Athlete Sentiment Tweets
Attributes
Original Data Source: GOAT Tweets