Opendatabay APP

Reddit Depression Reports Dataset

Mental Health & Wellness

Tags and Keywords

Data

Exploratory

Nlp

Nltk

Clustering

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Reddit Depression Reports Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset contains user reports about depression collected from various Reddit forums focused on depression-related topics. The data has been anonymised to protect user privacy by removing user IDs and publication dates. It provides a valuable resource for understanding personal experiences and expressions related to mental health, offering insights into depression-related discourse online.

Columns

  • title: This column represents the title of the user report, providing a concise summary or description of the report's content.
  • content: The content column contains the detailed report provided by the user. It may include personal experiences, thoughts, feelings, or any relevant information related to depression.
  • score: The score column represents the rating assigned to the publication by other users. This score indicates the level of engagement, agreement, or relevance as determined by the Reddit community.

Distribution

The dataset is typically provided in CSV format. It consists of approximately 12,456 records or rows. The data has been structured to include three main columns: title, content, and score. It is important to note that the dataset has been anonymised, with user IDs and publication dates removed to safeguard privacy.

Usage

This dataset can be utilised for various applications, including but not limited to:
  • Text analysis and natural language processing (NLP) tasks.
  • Sentiment analysis and emotion detection.
  • Topic modelling and clustering of depression-related content.
  • Depression research and in-depth analysis of user-reported mental health experiences.
  • Machine learning model training and evaluation for tasks such as content classification or trend prediction.

Coverage

  • Geographic Scope: The dataset has a Global reach, as the Reddit platform is accessible worldwide.
  • Time Range: Specific publication dates have been removed to preserve user anonymity, therefore, a defined time range is not available.
  • Demographic Scope: The data consists of user reports from Reddit forums specifically focused on depression-related topics. The user identities are anonymised.

License

CCO

Who Can Use It

This dataset is ideal for:
  • Data analysts and researchers interested in mental health, particularly depression, and social media discourse.
  • Natural Language Processing (NLP) specialists and machine learning engineers working on text analysis, sentiment analysis, or topic modelling projects.
  • Academics and students conducting studies in psychology, sociology, or digital humanities related to online communities and mental wellness.
  • Organisations or individuals focused on public health initiatives and understanding mental health trends through user-generated content.

Dataset Name Suggestions

  • Reddit Depression Reports
  • Anonymised Mental Health Posts
  • Online Depression Discourse Data
  • User-Reported Depression Insights
  • Mental Health Reddit Data

Attributes

Original Data Source: Depression Dataset

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

08/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free