E-commerce Sales and Customer Analytics
E-commerce & Online Transactions
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is designed for optimising an online retail business through customer behaviour analysis. It provides transactional data from UK Merch, a wholesale clothing company that expanded its operations across Europe. The dataset facilitates descriptive sales analysis, customer engagement measurement via cohort analysis, and strategic customer segmentation using the RFM (Recency, Frequency, Monetary) methodology. It addresses challenges faced by businesses in understanding their sales performance, customer loyalty, and how to focus marketing and sales efforts effectively based on data-driven insights. It is particularly useful for learning data cleaning and preprocessing techniques.
Columns
- N° de factura: A unique number assigned to each transaction. A 'c' prefix indicates a cancellation.
- Fecha de factura: The date and time when each transaction was generated.
- ID Cliente: A unique 5-digit integer identifier for each customer.
- País: The country of residence for each customer.
- Cantidad: The quantity of each product (item) involved in a transaction.
- Monto: The total invoice amount, expressed in British Pounds Sterling.
Distribution
The dataset is provided in CSV format and is approximately 1.53 MB in size. It contains 6 columns and approximately 26,000 records, representing sales data. Please note that the 'ID Cliente' column has some missing values, affecting about 14% of the records.
Usage
This dataset is ideal for:
- Performing descriptive sales analytics to assess business performance.
- Conducting customer cohort analysis to measure engagement and retention.
- Applying RFM customer segmentation to identify valuable customer groups.
- Practising data cleaning and preprocessing skills, including handling missing data and outliers.
- Developing business intelligence reports with key performance indicators.
- Understanding customer loyalty and purchasing patterns.
- Informing strategic marketing and sales decisions by identifying target customer segments.
- Visualising sales trends and customer behaviour.
Coverage
The dataset primarily covers sales transactions within the United Kingdom, accounting for 91% of the data. The remaining 9% is spread across 37 other countries, with Germany being the second most frequent at 2%. The data spans approximately one year, from 1st December 2020 to 9th December 2021. The 'ID Cliente' column has 14% missing values.
License
CC0: Public Domain
Who Can Use It
- Data Analysts: To practice data cleaning, create analytical reports, and perform customer segmentation and cohort analysis.
- E-commerce Business Owners/Managers: To gain insights into sales health, customer behaviour, and inform strategic decisions for business growth.
- Business Intelligence Professionals: For developing dashboards and visualisations that summarise key sales metrics.
- Students and Aspiring Data Scientists: As a practical project for learning data preprocessing, analysis, and interpretation techniques in a real-world business context.
Dataset Name Suggestions
- E-commerce Sales and Customer Analytics
- UK Merch Wholesale Transactions
- Customer Behaviour Segmentation Data
- Online Retail Sales Performance
- Business Sales Data (2020-2021)
Attributes
Original Data Source: E-commerce Sales and Customer Analytics