Opendatabay APP

Tales from the Pizza Guy Subreddit Data

Data Science and Analytics

Tags and Keywords

Reddit

Pizza

Delivery

Nlp

Stories

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Tales from the Pizza Guy Subreddit Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This collection provides a valuable snapshot of interactions and stories revolving around the pizza delivery industry, curated from the popular r/TalesFromThePizzaGuy subreddit. It consists of posts and accompanying comments, captured automatically and updated regularly. The data is ideally suited for natural language processing (NLP) tasks and for researchers interested in analysing common themes, sentiment, and narratives within service industry online communities.

Columns

  • title: The title of the original post on the subreddit.
  • Comment: A classification field indicating the nature or type of record entry.
  • score: The aggregated popularity score assigned to the post or comment, ranging widely from negative values to over 1200, with a mean score around 51.7.
  • id: The unique identifier assigned to each record. There are 3927 unique IDs present.
  • url: The specific web address link to the discussion thread. Approximately 73% of records have a missing URL.
  • comms_num: The total count of comments associated with the entry, with a maximum of 434 comments recorded for a single entry and an average of 9.11 comments.
  • created: The Unix timestamp representing when the entry was created.
  • body: The main text content of the record (the primary text of the post or comment). Only 1% of records are missing body content.
  • timestamp: A human-readable date and time representation of when the entry was created.

Distribution

The data is delivered in a standard CSV file format, named tales_from_the_pizza_guy.csv, with a file size of 2.24 MB. The structure includes 8 distinct columns. The collection contains 3927 valid records. The data is expected to be updated daily.

Usage

This data product is perfect for several applications:
  • NLP practitioners looking to test and refine models on specific domain language and service industry jargon.
  • Creating entertaining analyses and notebooks focused on common tropes or humorous narratives.
  • Sentiment analysis on customer and driver interactions.
  • Research into the discourse and experiences of essential service workers.

Coverage

The scope of this collection is strictly limited to posts and comments originating from the r/TalesFromThePizzaGuy subreddit community. The time range covered spans from September 8, 2020, through to October 7, 2022. The data focuses on general narratives related to pizza delivery, without specific geographic or demographic segmentation noted in the fields.

License

CC0: Public Domain

Who Can Use It

  • Data Scientists and NLP Specialists: Utilising the text for machine learning models, topic modelling, and lexical analysis.
  • Social Media Researchers: Studying community engagement and narrative structure in service industry forums.
  • Content Creators: Generating insights or articles based on funny, bizarre, or interesting real-life service sector stories.

Dataset Name Suggestions

  • Pizza Delivery Driver Stories
  • Reddit Pizza Guy Chronicles
  • Service Industry Tales: Pizza Edition
  • Tales from the Pizza Guy Subreddit Data

Attributes

Listing Stats

VIEWS

2

DOWNLOADS

0

LISTED

20/11/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format