Giftware Online Retail Data
E-commerce & Online Transactions
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides real transaction data from a UK-based, non-store online retail company, primarily selling unique all-occasion gift-ware. It covers transactions occurring between 1st December 2009 and 9th December 2011. Many of the company's customers are wholesalers. This dataset is valuable for analysing online retail trends, customer behaviour, and sales patterns over a two-year period.
Columns
- InvoiceNo: A 6-digit integral number uniquely assigned to each transaction. If the code begins with 'c', it signifies a cancellation. This nominal column has 53,628 unique values and 1.07 million valid entries, with no missing data.
- StockCode: A 5-digit integral number uniquely assigned to each distinct product. This nominal column contains 5,305 unique values and 1.07 million valid entries, with no missing data. The most common stock code is 85123A.
- Description: The name of the product or item. This nominal column has 5,699 unique values and 1.06 million valid entries, with 4,382 missing values. The most common description is "WHITE HANGING HEART T-LIGHT HOLDER".
- Quantity: The quantity of each product or item per transaction. This numeric column has 1.07 million valid entries and no missing data. Quantities range from -80,995 to 81,000, with a mean of 9.94. Negative quantities indicate returns or cancellations.
- InvoiceDate: The date and time when a transaction was generated. This numeric column has 1.07 million valid entries and no missing data, covering the period from 1st December 2009 to 9th December 2011.
- UnitPrice: The product price per unit in sterling (£). This numeric column has 1.07 million valid entries and no missing data. Unit prices range from -53,600 to 39,000, with a mean of 4.65.
- CustomerID: A 5-digit integral number uniquely assigned to each customer. This nominal column has 824,000 valid entries, with 243,000 missing values (23%). It contains 5,942 unique customer IDs. The customer IDs range from 12,300 to 18,300, with a mean of 15,300.
- Country: The name of the country where a customer resides. This nominal column has 1.07 million valid entries and no missing data, with 43 unique countries represented. The United Kingdom accounts for 92% of the records.
Distribution
This dataset is provided in CSV format, with a file size of 94.85 MB. It contains 8 columns and approximately 1.07 million records. The 'Customer ID' column has a significant number of missing values, accounting for 23% of the total records.
Usage
This dataset is ideal for various analytical applications, including:
- Customer segmentation and profiling using RFM (Recency, Frequency, Monetary) models.
- Predicting customer profitability over time.
- Analysing sales trends and patterns across different products and time periods.
- Market basket analysis to identify product associations.
- Developing and testing predictive models for retail forecasting.
- Educational purposes in data mining, business analytics, and economics.
Coverage
The dataset covers transactions from a UK-based online retailer. Geographically, while primarily focused on the United Kingdom (92% of records), it includes transactions from 42 other countries. The temporal scope spans two years, from 1st December 2009 to 9th December 2011. There are no specific demographic details provided beyond the note that many customers are wholesalers.
License
CC0: Public Domain
Who Can Use It
This dataset is suitable for:
- Data Scientists and Analysts: For building predictive models, conducting in-depth retail analytics, and exploring customer behaviour.
- Academics and Students: For research, case studies, and learning about transactional data analysis, data mining, and business intelligence.
- Retail Businesses: For gaining insights into their sales performance, customer base, and product popularity.
Dataset Name Suggestions
- UK Online Retail Transactions 2009-2011
- E-commerce Sales UK Dataset
- Giftware Online Retail Data
- Wholesale & Retail Transactions UK
Attributes
Original Data Source: Giftware Online Retail Data