Reddit Bitcoin Comments Dataset
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers a window into user perspectives on one of the world's most popular cryptocurrencies, Bitcoin. It contains user comments from the Bitcoin Subreddit, spanning from early 2020 until now, providing insights into user conversations, topics discussed, and sentiments expressed within this vibrant online community. It is a valuable resource for breaking down comments based on time, replies, and score to gain unique insights, follow trends over time, and identify primary hot topics that excite the Bitcoin subreddit.
Columns
- title: The title of the comment. (String)
- score: The amount of upvotes received by the comment. (Integer)
- url: The link to the individual Reddit page where a user can view all replies/responses associated with their initial post/comment. (String)
- comms_num: The number of replies made regarding a particular initial post/comment. (Integer)
- created: Date & time when the comment was initially posted. (DateTime)
- body: Main content text provided in each individual post/comment. (String)
- timestamp: Time stamp converted into a local US zone setting. (DateTime)
Distribution
The data file is typically in CSV format. It contains comments from the Bitcoin Subreddit. While a single total row count is not specified, examples of data distribution include score ranges from -9.00 to 4304.00, with 1,852 records in the -9.00 to 422.30 range. Timestamp data is provided in specific bins, for example, from 1670500766.00 to 1670593199.10 containing 81 records, and daily counts such as 979 records for 12/18/2022 - 12/19/2022.
Usage
This dataset is ideal for various applications, including:
- Conducting sentiment analysis of Bitcoin Subreddit comments to examine the public's perception of cryptocurrency.
- Identifying and visualising correlations between Reddit comments and changes in the value of Bitcoin cryptocurrency markets over time.
- Identifying user trends in topic preferences for Bitcoin discussions on Reddit by analysing the body content, topics discussed, and URL associated with each comment. A working understanding of statistical concepts such as descriptive statistics, central tendency, and distributions, as well as basic SQL queries, is helpful for utilising this data effectively.
Coverage
The dataset covers user comments from the Bitcoin Subreddit. Its time range spans from early 2020 until now. Geographic scope is global, reflecting the nature of Reddit. Specific examples of data availability are shown for daily periods in December 2022.
License
CC0
Who Can Use It
This dataset is valuable for:
- Data Scientists and Analysts: To gain unique insights into user conversations, topics, and sentiments in the Bitcoin community.
- Researchers: For studying cryptocurrency market dynamics, public perception, and online community behaviour.
- Developers: To build applications that track or analyse cryptocurrency discussions.
Dataset Name Suggestions
- Reddit Bitcoin Comments Dataset
- Bitcoin Subreddit Activity Log
- Cryptocurrency Discussion Data
- Bitcoin User Perspectives
Attributes
Original Data Source: Reddit: /r/Bitcoin