Opendatabay APP

Market Basket Analysis Introduction Dataset

Retail & Consumer Behavior

Tags and Keywords

Apriori

Retail

Association

Algorithm

Shopping

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Market Basket Analysis Introduction Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Exploring the correlation between disparate retail products like beer and diapers provides a classic entry point into association rule mining. Sourced from discussions at industry summits, this collection documents specific transaction patterns to illustrate the logic of the Apriori algorithm. By examining a limited set of records, learners can evaluate how seemingly unrelated items might appear in the same shopping cart, providing a foundational understanding of market basket analysis and the ways statistical correlations are interpreted in business intelligence.

Columns

  • TID: The unique identification number assigned to each individual transaction record.
  • item: The specific products or goods purchased by a customer during a single transaction, such as bread or milk.

Distribution

The information is delivered in a CSV file titled beer and diaper.csv, with a file size of 136 B. It consists of 5 valid records structured across 2 distinct columns. The data maintains a perfect integrity profile with 100% validity and zero missing or mismatched entries. This is a static resource with a usability score of 10.00, and no future updates are expected.

Usage

This resource is ideal for students and educators seeking a simplified environment to manually calculate or programmatically test the Apriori algorithm. It serves as an introductory tool for association rule mining, allowing users to identify frequent itemsets and generate confidence metrics without the complexity of large-scale retail databases. Analysts can also use it to study the historical "Beer and Diapers" case study and the ambiguity often found in data correlation claims.

Coverage

The scope is limited to a symbolic sample of 5 transactions, purposefully kept small to facilitate ease of learning for beginners. While the conceptual origins are tied to retail data discussed in professional summits, the records provide a universal model for introductory data mining exercises. The data represents a fixed point in time and does not include broader demographic or geographic subsets beyond the context of the case study.

License

CC0: Public Domain

Who Can Use It

Data science beginners can leverage these records to practice their first association rules using tools like pandas. Academic instructors can utilise the sample as a clear, manageable example for classroom demonstrations of data mining concepts. Additionally, researchers interested in information management can use the data to revisit and deconstruct the famous "Beer and Diapers" correlation narrative.

Dataset Name Suggestions

  • Apriori Algorithm Starter: Beer and Diapers Case Study
  • Classic Retail Association Rule Mining Sample
  • Beer and Diapers: Small Sample for Data Mining Education
  • Market Basket Analysis Introduction Dataset
  • Beginner-Friendly Apriori Algorithm Exercise Data

Attributes

Listing Stats

VIEWS

2

DOWNLOADS

1

LISTED

21/12/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format