Opendatabay APP

Jokes Text Dataset

Entertainment & Media Consumption

Tags and Keywords

Arts

Beginner

Text

Intermediate

Nlp

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Jokes Text Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset is a web-scraped collection of one-liner dad jokes, curated to provide a humorous and relatable text resource. Each joke typically features a first part as a statement or question, followed by a witty response, characteristic of classic dad humour [1, 2]. It is designed for applications where engaging and light-hearted text data is required, offering a unique resource for entertainment and computational humour studies [2].

Columns

The dataset contains a single column:
  • Joke: This column holds the text of each one-liner joke. Each entry is structured as a statement or question followed by its comedic response [1].

Distribution

The dataset is presented in a tabular format, typically a CSV file [3]. It comprises 743 unique joke values, offering a diverse yet focused collection of one-liners [1]. Specific row or record counts beyond unique values are not detailed in the provided information [1, 4].

Usage

This dataset is ideal for various applications, including:
  • Natural Language Processing (NLP) tasks, such as text generation, sentiment analysis, or understanding comedic structures [2].
  • Machine Learning (ML) projects, particularly for beginners exploring text data [2].
  • Developing entertainment applications or chatbots that incorporate humour [2].
  • Academic research into computational humour and language patterns [2].

Coverage

The dataset is stated to have a global region coverage [5]. It focuses exclusively on dad jokes in the form of one-liners, without specifying particular demographic ranges or time periods for the jokes' origin or collection beyond its creation date listed as 17/06/2025 [5].

License

CCO

Who Can Use It

This dataset is suitable for:
  • Data scientists and machine learning engineers looking for text data for model training and experimentation.
  • Developers creating applications that require comedic content or text generation.
  • Researchers in linguistics, AI, and humour studies.
  • Beginners in data science and NLP seeking accessible text datasets for learning and practice [2].

Dataset Name Suggestions

  • Dad Jokes Database
  • One-Liner Humour Collection
  • The Dad-A-Base of Jokes
  • Jokes Text Dataset

Attributes

Original Data Source: Dad-A-Base Of Jokes

Listing Stats

VIEWS

3

DOWNLOADS

0

LISTED

17/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format