Jokes Text Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is a web-scraped collection of one-liner dad jokes, curated to provide a humorous and relatable text resource. Each joke typically features a first part as a statement or question, followed by a witty response, characteristic of classic dad humour [1, 2]. It is designed for applications where engaging and light-hearted text data is required, offering a unique resource for entertainment and computational humour studies [2].
Columns
The dataset contains a single column:
- Joke: This column holds the text of each one-liner joke. Each entry is structured as a statement or question followed by its comedic response [1].
Distribution
The dataset is presented in a tabular format, typically a CSV file [3]. It comprises 743 unique joke values, offering a diverse yet focused collection of one-liners [1]. Specific row or record counts beyond unique values are not detailed in the provided information [1, 4].
Usage
This dataset is ideal for various applications, including:
- Natural Language Processing (NLP) tasks, such as text generation, sentiment analysis, or understanding comedic structures [2].
- Machine Learning (ML) projects, particularly for beginners exploring text data [2].
- Developing entertainment applications or chatbots that incorporate humour [2].
- Academic research into computational humour and language patterns [2].
Coverage
The dataset is stated to have a global region coverage [5]. It focuses exclusively on dad jokes in the form of one-liners, without specifying particular demographic ranges or time periods for the jokes' origin or collection beyond its creation date listed as 17/06/2025 [5].
License
CCO
Who Can Use It
This dataset is suitable for:
- Data scientists and machine learning engineers looking for text data for model training and experimentation.
- Developers creating applications that require comedic content or text generation.
- Researchers in linguistics, AI, and humour studies.
- Beginners in data science and NLP seeking accessible text datasets for learning and practice [2].
Dataset Name Suggestions
- Dad Jokes Database
- One-Liner Humour Collection
- The Dad-A-Base of Jokes
- Jokes Text Dataset
Attributes
Original Data Source: Dad-A-Base Of Jokes