The Reddit Dataset Dataset
Data Science and Analytics
Related Searches
Trusted By




"No reviews yet"
Free
About
Context
Datasets… In a way, the Kaggle community is built around them. You can't analyze data without having it. Here, we aim to create a meta-corpus of datasets posted to Reddit. A dataset dataset, if you will.
Content
The following dataset is the comprehensive corpus of all the posts and comments made on Reddit's /r/datasets board, from its inception all the way to the first of March, 2022.
The dataset was procured using SocialGrep.
To preserve users' anonymity and to prevent targeted harassment, the data does not include usernames.
Acknowledgements
We would like to thank Chris Liverani for generously providing the cover image for this dataset.
Inspiration
Datasets are nice - we like our data.
License
CC By 4.0
Original Data Source: The Reddit Dataset Dataset