Statistical AB Test Practice Set
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides aggregated data from a split experiment, designed for statistical analysis and interview preparation. It includes key metrics related to user behaviour in an experiment, with parameters such as success rate, uplift, beta, and skew used in its preparation. This makes it a valuable resource for practicing data analysis techniques.
Columns
- user_id: A unique identifier for each user. There are 130,000 unique user IDs, ranging from 1 to 130,000, with a mean value of approximately 65,000.
- group: Indicates the treatment variant assigned to the user. This column has two unique values, with 'control' being the most common, accounting for 50% of the entries.
- views: Represents the number of experiment functionality views. This metric has a mean of 5.01 views per user, with a maximum recorded views of 334.
- clicks: Denotes the number of experiment functionality clicks. The average clicks per user is 0.29, and the maximum clicks recorded for a single user is 23.
Distribution
The dataset is provided in a CSV format and has a file size of 2.7 MB. It consists of user-aggregated split experiment data. All columns contain 130,000 valid records, with no mismatched or missing values, indicating a complete and well-structured dataset.
Usage
This dataset is ideally suited for preparing for interviews and practising statistical tests. It offers a practical scenario for applying various statistical methods to analyse A/B test results.
Coverage
The sources do not provide specific details regarding the geographic, time range, or demographic scope of the data.
License
CC0: Public Domain
Who Can Use It
This dataset is primarily intended for data professionals, analysts, and students who wish to:
- Hone their skills in AB testing analysis.
- Prepare for data science and analytics interviews by working through real-world aggregated data scenarios.
- Practice various statistical tests and hypothesis testing on user behaviour data.
Dataset Name Suggestions
- AB Test User Behaviour Metrics
- Split Experiment Aggregated Data
- Statistical AB Test Practice Set
- User Engagement AB Test
Attributes
Original Data Source: Statistical AB Test Practice Set