Introductory E-commerce Dummy Data
E-commerce & Online Transactions
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
A collection of dummy sales records designed specifically for testing, educational, and introductory data science applications. This product offers a ready-to-use sample set ideal for practicing data manipulation techniques and performing initial statistical analysis using libraries such as NumPy and pandas. It captures essential transaction details related to computer product sales, including metrics on product types, pricing, profitability, and basic customer demographics.
Columns
The dataset contains 12 columns, providing detailed information about each sales event. Key fields include:
- Sale ID: A unique identifier for the transaction, ranging from 1 to 39.
- Contact: The customer contact name (17 unique entries), with Michelle Samms being the most frequently recorded.
- Sex: Categorical data identifying the customer's sex (Male 56%, Female 44%).
- Age: The customer's age, spanning from 23 to 57 years, with a mean of approximately 45.7.
- State: The location of the sale, featuring four unique states, predominantly Pennsylvania (PA) and Ohio (OH).
- Product ID: The identifier for the specific item sold (9 unique types).
- Product Type: Categorization of the item, most commonly Laptop (59%) or Desktop (31%).
- Sale Price: The final price of the product sold, ranging from £400 up to £1.35k, with an average price of £837.
- Profit: The financial return from the sale, spanning £72.10 to £231.
- Lead: The channel that generated the sale, such as Website or Flyer 2.
- Month: The month the sale was recorded (9 unique months, with March and November being the most common).
- Year: The year of the transaction, covering 2018 through 2020.
Distribution
The data is provided as a single CSV file,
ComputerSales.csv, weighing approximately 2.94 kB. The dataset contains 39 distinct records, all of which are valid with no reported missing or mismatched values. The structure is optimal for platform listing as the data file is usually in CSV format.Usage
This dataset is perfectly suited for demonstrating fundamental data manipulation techniques in educational settings, such as performing tutorials using NumPy or pandas. It can be used for introductory tasks in business analytics, generating reports on average sale prices and profits, and executing basic time-series analyses spanning the years 2018 to 2020.
Coverage
The data covers sales recorded between 2018 and 2020. Geographically, transactions are limited to four distinct states in the US. Demographically, the data includes age and sex information, making it suitable for analyses focused on specific age brackets or customer gender distributions.
License
CC0: Public Domain
Who Can Use It
Data Science Students: For learning data loading, cleaning, and basic statistical modelling.
Instructors: To provide small, clean, and predictable sample data for coding exercises and assignments.
Tool Developers: Requiring a reliable, fixed dummy set for application testing and validation.
Dataset Name Suggestions
- Computer Sales Tutorial Records
- Introductory E-commerce Dummy Data
- Computer Product Transaction Log
- Data Science Sample CSV
Attributes
Original Data Source:Introductory E-commerce Dummy Data
Loading...
