AskScience Subreddit Engagement Data
Social Media and Networking
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers an insightful look into the Reddit AskScience subreddit, one of the web's most cherished science communities [1]. It provides key insights into what makes AskScience such an engaging community, filled with passionate science enthusiasts [1]. The data includes details on post titles, scores, unique identifiers, URLs, comment counts, creation times, post bodies, and timestamps [1-3]. By combining this information, one can gain a better understanding of discussants' interests and preferences, and derive valuable lessons for building more robust online communities [1].
Columns
- title: The title of a post. (String) [1-3]
- score: The number of upvotes a post has received. (Integer) [1-3]
- id: The unique identifier of a post. (String) [1-3]
- url: The URL of a post. (String) [1-3]
- comms_num: The number of comments a post has received. (Integer) [1-3]
- created: The date and time that a post was created. (DateTime) [1-3]
- body: The body text associated with a post. (String) [1-3]
- timestamp: The date and time when a post was last updated. (DateTime) [1-3]
Distribution
The dataset is typically provided in a CSV file format [4]. It contains 1289 unique entries across various data points, reflecting post scores, comment counts, and creation timestamps [5-7]. Post scores range from -10 to 5935, with the majority (1,178 posts) falling between -10 and 584.50 [5]. Comment numbers range from 0 to 675, with 1,165 posts having between 0 and 67.50 comments [6]. The data covers a period from 17th October 2022 to 17th December 2022 [7].
Usage
This dataset is ideal for research and analysis, allowing users to:
- Analyse the types of questions, tags, and topics that receive the most upvotes [1].
- Compare engagement levels within the AskScience subreddit over time [1].
- Examine how formatting might influence post engagement and popularity (e.g., bolding titles, using images) [1].
- Understand discussants' interests and preferences to inform the creation of more engaging online communities [1].
Coverage
The dataset's geographic coverage is global [8]. The temporal scope ranges from 17th October 2022 to 17th December 2022 [7]. It captures content and engagement patterns from a community of passionate science enthusiasts [1].
License
CC0
Who Can Use It
This dataset is suitable for:
- Researchers and Academics: To study online community dynamics, social media engagement, and public interest in scientific topics [1].
- Data Analysts: For deriving insights into user behaviour and content performance on social platforms [1].
- Community Managers: To understand effective strategies for fostering enthusiastic and engaging online discussions [1].
- Developers: To build applications that leverage insights from science-focused online communities.
Dataset Name Suggestions
- Reddit AskScience Posts
- AskScience Subreddit Engagement Data
- Science Community Online Discussions
- Reddit r/AskScience Data Analysis
Attributes
Original Data Source: Reddit: /r/AskScience