Transnational Retail Purchase History
Retail & Consumer Behavior
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This transnational dataset captures all retail transactions from a UK-based non-store online retail business, spanning from 1st December 2010 to 9th December 2011. The company specialises in unique all-occasion gifts, and a significant portion of its customer base consists of wholesalers. This dataset offers insights into online purchasing behaviour and transaction patterns across different countries.
Columns
- InvoiceNo: A 6-digit integral number uniquely assigned to each transaction. If the code begins with 'c', it indicates a cancellation.
- StockCode: A 5-digit integral number uniquely assigned to each distinct product or item.
- Description: The name of the product or item. Note that this column has some missing values.
- Quantity: The quantity of each product or item per transaction, represented as a numeric value. Quantities can be negative, likely indicating returns or refunds.
- InvoiceDate: The date and time when each transaction was generated, presented as a numeric value.
- UnitPrice: The price per unit of the product in sterling, a numeric value. This can also include negative values.
- CustomerID: A 5-digit integral number uniquely assigned to each customer. Approximately 25% of the records have missing customer IDs.
- Country: The name of the country where each customer resides. There are 38 unique countries represented, with the United Kingdom accounting for 91% of the data.
Distribution
The dataset is typically provided as a CSV file and has a size of approximately 43.95 MB. It comprises 8 columns and contains over 540,000 transaction records, though approximately 407,000 of these records include a valid Customer ID.
Usage
This dataset is ideal for various analytical applications, including:
- Market Segmentation: Identifying distinct customer groups based on their purchasing behaviour for targeted marketing strategies.
- Customer Behaviour Analysis: Studying customer purchasing patterns, frequency, and monetary value, often used in RFM (Recency, Frequency, Monetary) modelling.
- Sales Forecasting: Predicting future sales trends, identifying best-selling products, and managing inventory.
- Geographic Analysis: Understanding sales distribution and customer origins across different countries to inform international marketing efforts.
- Marketing Strategy Development: Aiding in the evolution and refinement of direct, data, and digital marketing initiatives.
Coverage
The dataset covers transactions from 1st December 2010 to 9th December 2011. Geographically, it encompasses transactions from 38 different countries, with the United Kingdom being the predominant location, representing 91% of all records. The retail company itself is UK-based, and a notable demographic insight is that many of its customers are wholesalers.
License
CC0: Public Domain
Who Can Use It
This dataset is particularly suitable for:
- Data Analysts and Scientists: For exploratory data analysis, developing machine learning models (e.g., clustering algorithms for customer segmentation, recommendation systems), and performing predictive analytics.
- Academic Researchers: For conducting studies in e-commerce trends, retail analytics, consumer behaviour, and various data mining applications.
- Businesses and Retailers: Especially online retailers, to gain actionable insights into their sales performance, customer base, and inventory management.
- Marketing Professionals: To develop more effective and precise marketing campaigns, understand customer engagement, and assess promotional effectiveness.
Dataset Name Suggestions
- Online Retail Transactions 2010-2011
- UK E-commerce Sales Data
- Transnational Retail Purchase History
- Gift Retail Transaction Log
Attributes
Original Data Source: Transnational Retail Purchase History