Opendatabay APP

Social Media Enlightenment Dataset

Knowledge Bundles

Tags and Keywords

Online

Communities

Text

Literature

Nlp

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Social Media Enlightenment Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a rich collection of tweets from prominent "self-help" authors on Twitter. It aims to offer insights into wisdom-focused content, allowing for exploration of themes related to improving one's life and achieving success. The data was collected using the Tweepy API and includes tweets, retweets, and retweets with comments from over 40 distinct authors. It is a valuable resource for understanding the dynamics of viral content and common patterns within self-improvement discourse on social media.

Columns

  • author_name: The name of the author of the tweet.
  • created_at: The date and time when the tweet was created, recorded in IST.
  • handle: The Twitter handle associated with the author.
  • likes: The total number of likes a tweet had received at the point of data collection.
  • retweets: The total number of retweets a tweet had accumulated at the point of data collection.
  • tweet_content: The textual content of the tweet itself.

Distribution

The dataset is typically provided as a data file, often in CSV format. It includes tweets from more than 40 authors. The collection spans a wide timeframe, with tweet creation dates ranging from 16th July 2009 to 23rd September 2019. The dataset contains a substantial number of records, with specific counts for various date ranges indicating significant volume, for example, over 17,000 tweets between March and September 2019 alone. Distributions for likes and retweets reveal varying engagement levels across the tweets.

Usage

This dataset is ideal for:
  • Exploring the language and themes of "self-help" tweets.
  • Understanding the factors that contribute to a tweet becoming viral.
  • Analysing author activity and engagement trends on Twitter.
  • Developing Natural Language Processing (NLP) models.
  • Studying online communities and social media content trends.

Coverage

The dataset has a global reach. The data was collected between 16th July 2009 and 23rd September 2019. It focuses on "self-help" related content from more than 40 distinct Twitter authors, with a notable presence from authors such as Thomas Sowell (7%) and The Ancient Sage (5%), alongside a large proportion from other authors (88%).

License

CC0

Who Can Use It

This dataset is suitable for:
  • Researchers and Academics: For social media studies, linguistic analysis, and behavioural science research.
  • Data Scientists and Analysts: For building predictive models, conducting NLP tasks, and extracting insights from unstructured text.
  • Content Strategists: To understand popular "self-help" themes and engagement patterns on Twitter.
  • Individuals interested in personal development: To explore and understand the words of wisdom shared on social media.

Dataset Name Suggestions

  • Twitter Wisdom Collective
  • Self-Help Tweets Archive
  • Viral Tweet Insights
  • Social Media Enlightenment Dataset
  • Modern Day Proverbs

Attributes

Original Data Source: The Tweets of Wisdom

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

24/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format