Beginner ML Customer Data
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Explaining a customer dataset designed for beginners to practice fundamental machine learning problems. The data is tabular and focuses on customer churn attributes. Its primary use case is providing easily accessible information for developing and testing initial ML models. The usability of the dataset is rated at 10.00, and updates are expected annually.
Columns
The dataset includes five columns:
- age: Represents the age of the customer. The values range from 15 to 98, with a mean age of 54.2 and a standard deviation of 25.4.
- gender: Categorical data indicating gender. Female customers account for 58% of the records, and Male customers account for 42%.
- review: Customer feedback, where 'Poor' and 'Good' reviews each make up 36% of the data. The remaining 28% falls under 'Other'.
- education: The customer's qualification level. 'PG' (Post Graduate) is the most frequent category (36%), followed by 'School' (32%).
- purchased: A boolean variable indicating whether the customer purchased a product. False accounts for 52% of the records, and True accounts for 48%.
Distribution
This is a small, tabular dataset contained within a
customer.csv file, approximately 1.23 kB in size. It consists of 5 columns and 50 valid records for every column. Crucially for beginners, there are zero missing or mismatched values reported across all features.Usage
This dataset is ideal for practising machine learning challenges, specifically introductory classification problems. It supports feature engineering exercises and serves as a clean, ready-to-use resource for new data scientists testing basic predictive models, such as predicting customer purchasing behaviour.
Coverage
The dataset is relevant to the United States. The demographic scope is broad, capturing customers across nearly the entire lifespan, with ages ranging from 15 to 98. It includes distinct categorisations for gender, review score, and educational qualification.
License
CC0: Public Domain.
Who Can Use It
Intended users include students seeking introductory data to train their first models, machine learning practitioners focusing on feature engineering methods, and educators looking for a small, perfectly clean sample dataset for teaching basic data analysis and classification tasks.
Dataset Name Suggestions
- Beginner ML Customer Data
- Customer Churn Starter Pack
- US Customer Attribute Dataset
- Foundational Purchase Prediction Data
Attributes
Original Data Source: Beginner ML Customer Data
Loading...
