Social Media Enlightenment Dataset
Knowledge Bundles
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a rich collection of tweets from prominent "self-help" authors on Twitter. It aims to offer insights into wisdom-focused content, allowing for exploration of themes related to improving one's life and achieving success. The data was collected using the Tweepy API and includes tweets, retweets, and retweets with comments from over 40 distinct authors. It is a valuable resource for understanding the dynamics of viral content and common patterns within self-improvement discourse on social media.
Columns
author_name
: The name of the author of the tweet.created_at
: The date and time when the tweet was created, recorded in IST.handle
: The Twitter handle associated with the author.likes
: The total number of likes a tweet had received at the point of data collection.retweets
: The total number of retweets a tweet had accumulated at the point of data collection.tweet_content
: The textual content of the tweet itself.
Distribution
The dataset is typically provided as a data file, often in CSV format. It includes tweets from more than 40 authors. The collection spans a wide timeframe, with tweet creation dates ranging from 16th July 2009 to 23rd September 2019. The dataset contains a substantial number of records, with specific counts for various date ranges indicating significant volume, for example, over 17,000 tweets between March and September 2019 alone. Distributions for likes and retweets reveal varying engagement levels across the tweets.
Usage
This dataset is ideal for:
- Exploring the language and themes of "self-help" tweets.
- Understanding the factors that contribute to a tweet becoming viral.
- Analysing author activity and engagement trends on Twitter.
- Developing Natural Language Processing (NLP) models.
- Studying online communities and social media content trends.
Coverage
The dataset has a global reach. The data was collected between 16th July 2009 and 23rd September 2019. It focuses on "self-help" related content from more than 40 distinct Twitter authors, with a notable presence from authors such as Thomas Sowell (7%) and The Ancient Sage (5%), alongside a large proportion from other authors (88%).
License
CC0
Who Can Use It
This dataset is suitable for:
- Researchers and Academics: For social media studies, linguistic analysis, and behavioural science research.
- Data Scientists and Analysts: For building predictive models, conducting NLP tasks, and extracting insights from unstructured text.
- Content Strategists: To understand popular "self-help" themes and engagement patterns on Twitter.
- Individuals interested in personal development: To explore and understand the words of wisdom shared on social media.
Dataset Name Suggestions
- Twitter Wisdom Collective
- Self-Help Tweets Archive
- Viral Tweet Insights
- Social Media Enlightenment Dataset
- Modern Day Proverbs
Attributes
Original Data Source: The Tweets of Wisdom