Reddit r/FloridaMan Posts Archive
Reddit & Forum Data
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
A collection documenting all article titles and associated external links published to the Reddit r/FloridaMan community. This resource archives news stories related to the notable "Florida Man" phenomenon dating back to 2014. The posts were not subjected to filtering, meaning the content might include a small amount of harsh language.
Columns
The dataset contains six essential fields detailing the submission metadata and content:
- post_id: The unique identification code assigned to the post by Reddit.
- created_at: The date when the original article was submitted to the Reddit platform.
- score: The community engagement metric, calculated as upvotes minus downvotes.
- title: The headline of the external article submitted to the subreddit.
- posted_by: The Reddit user responsible for posting the article.
- url: The web address linking to the original news article.
Distribution
The data is typically provided as a CSV file, named
florida_man.csv, with a file size of 9.43 MB. The collection totals approximately 42,800 records. Within this structure, the title field contains roughly 41,000 unique article headlines. Community scores vary widely, with the maximum recorded score reaching just over 30,800.Usage
This data is ideally suited for academic research and data projects, including:
- Analysing the correlation between article titles and community engagement (score).
- Tracking submission activity and trends within the online community over time.
- Supporting research into viral media phenomena and headline creation.
- Developing natural language processing (NLP) models focused on text from online communities.
Coverage
The data focuses exclusively on content submitted to the r/FloridaMan subreddit. The time range of the posts spans from 1 January 2014 to 30 April 2022. New data is expected to be added to this collection annually.
License
CC0: Public Domain
Who Can Use It
This material is beneficial for a variety of users:
- Data Scientists: For training and testing NLP algorithms using diverse and unfiltered text data.
- Researchers: To study social media trends, content moderation, and the life cycle of viral narratives.
- Social Media Analysts: To understand online community behaviour and engagement metrics like the post score.
Dataset Name Suggestions
- Reddit r/FloridaMan Posts Archive
- Viral Florida Man Headline Collection
- Florida Man Subreddit Data (2014-2022)
Attributes
Original Data Source: Reddit r/FloridaMan Posts Archive
Loading...
