Curated Life Advice from Reddit
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Explore a collection of over 13,000 valuable life tips sourced from the popular Reddit communities r/LifeProTips and r/YouShouldKnow. This curated collection captures practical and genuine advice shared by real people, focusing on posts that have received more than 1,000 upvotes. It offers a unique insight into a wide range of popular life topics and common themes discussed within these well-known online forums.
Columns
- id: The unique identifier for the post, assigned by Reddit.
- author: The username of the person who created the post.
- isOver18: A boolean value indicating if the post is marked as Not Safe for Work (NSFW).
- postUrl: The direct URL to the original Reddit post.
- subreddit: The name of the subreddit where the post was published, either 'LifeProTips' or 'YouShouldKnow'.
- postTitle: The title of the Reddit post.
- hasPostBody: A boolean value indicating whether the post includes a body of text.
- postBody: The main text content of the post. This field is null if the post does not have a body.
- score: The total upvote score of the post, as given by users.
- numComments: The total number of comments on the original Reddit post.
Distribution
The data is provided in a single CSV file named
helpfulRedditPosts.csv
with a size of 10.76 MB. It contains approximately 13,100 records, each representing a unique life tip. The dataset has 10 columns.Usage
This dataset is ideal for a variety of text-based analysis and application development projects.
- Web Application Development: Create a web app to display a new, interesting life tip to users daily.
- Data Exploration and Analysis: Investigate the data to identify the most popular categories of life tips and uncover common patterns or themes.
- Recommendation Systems: Build a model that suggests relevant tips to users based on specific topics or life situations.
- AI Model Training: Use the dataset as training material for AI-powered models designed to generate new, useful life tips.
Coverage
The dataset covers posts from two subreddits, r/LifeProTips and r/YouShouldKnow, spanning the years from 2005 to 2022. It contains content from a global user base, reflecting the diverse demographics of Reddit users.
License
CC0: Public Domain
Who Can Use It
- Data Scientists and Analysts: Can perform textual analysis to find trends in social media advice and user engagement.
- App Developers: Can integrate the tips into applications focused on self-improvement, daily motivation, or knowledge sharing.
- AI/ML Engineers: Can use the text data to train natural language processing (NLP) models for text generation or classification tasks.
- Beginners in Data Science: The dataset is well-structured and suitable for beginners looking to practice their data analysis and visualisation skills.
Dataset Name Suggestions
- Reddit's Top Life Pro Tips
- Curated Life Advice from Reddit
- Popular Life Hacks from r/LifeProTips & r/YouShouldKnow
- Helpful Human Advice: A Reddit Dataset
Attributes
Original Data Source: Curated Life Advice from Reddit