Opendatabay APP

WallStreetBets Reddit Posts Archive

Social Media and Posts

Tags and Keywords

Reddit

Wallstreetbets

Finance

Posts

Social

Trusted By
Trusted by company1Trusted by company2Trusted by company3
WallStreetBets Reddit Posts Archive Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a detailed collection of Reddit post submissions from the r/wallstreetbets community, spanning from 2012 onwards. It offers insights into the community's discussions, sentiments, and trending topics related to finance and investments. The data was extracted using the PushShift API for Reddit.

Columns

  • id: A unique identifier for each Reddit post.
  • title: The title of the Reddit post.
  • score: The numerical score or upvote count of the post.
  • author: The username of the post's author.
  • author_flair_text: Text associated with the author's flair.
  • removed_by: Indicates who removed the post, if applicable.
  • total_awards_received: The total number of awards received by the post.
  • awarders: Information detailing awards received.
  • created_utc: The creation timestamp of the post in Coordinated Universal Time (UTC).
  • full_link: The direct URL link to the Reddit post.
  • num_comments: The total number of comments on the post.
  • over_18: A boolean indicator specifying if the post is marked as Not Safe for Work (NSW).

Distribution

The dataset is provided in CSV format, with a file size of 220.16 MB. It contains over 1.1 million unique post records. Data files are typically in CSV format, and a sample file will be updated separately to the platform.

Usage

This dataset is suitable for analysing social media sentiment towards financial markets, tracking discussion trends within online investment communities, and studying the dynamics of retail investor behaviour. It can be used for research into financial linguistics, community engagement analysis, and for building models to predict market movements based on social chatter.

Coverage

The data covers Reddit posts from the r/wallstreetbets subreddit, originating from April 2012 up to February 2021. It captures a global online community's discussions without specific geographical or demographic restrictions beyond the Reddit user base.

License

CC0: Public Domain

Who Can Use It

  • Data Scientists and Analysts: For sentiment analysis, trend identification, and predictive modelling in finance.
  • Researchers: Studying online communities, social finance, and crowd behaviour influencing markets.
  • Financial Professionals: Gaining insights into retail investor sentiment and identifying emerging investment themes.
  • Academics: Utilising real-world social data for studies in economics, sociology, and computer science.

Dataset Name Suggestions

  • WallStreetBets Reddit Posts Archive
  • WSB Community Discussions (2012-2021)
  • Reddit WallStreetBets Post Submissions
  • Financial Sentiment: WallStreetBets Data
  • Social Trading Trends: Reddit WSB Insights

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

06/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format