SheIn Clothing Catalogue Dataset
E-commerce & Online Transactions
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers an extensive collection of over 109,000 product entries from the SheIn e-commerce website, obtained through web scraping on 11th March 2023. It provides detailed information on various clothing items, making it an invaluable resource for E-commerce analytics in the fashion industry. Similar to the Asos E-Commerce Dataset, this collection facilitates deep dives into online retail trends and product insights.
Columns
The dataset includes the following columns:
- url: The direct link to the item on the SheIn website.
- name: The product's name.
- date: The specific date of web scraping, which is 11th March 2023 for all items.
- SKU: A unique identifier for each product item.
- price: The monetary cost of the item.
- size: The available sizes for the product on the website.
- brand: The brand name of the clothing item.
- description: Additional descriptive text, provided in JSON format, which includes details like available colours, material composition, and care instructions.
- images: Links or references to photographs associated with the item's description.
Distribution
The SheIn E-Commerce Dataset is provided in CSV format (
shein_sample.csv
), with a size of approximately 212.52 MB. It contains information on over 109,000 individual product entries, with approximately 110,000 unique URLs. While most columns are fully populated, some entries may have missing values, for instance: 2 missing values for size
, 63 for brand
, 819 for description
, and 1,789 for images
.Usage
This dataset is ideally suited for various applications within the fashion and e-commerce sectors, particularly for E-commerce analytics. Potential use cases include:
- Analysing fashion trends and product popularity.
- Developing recommendation systems for online shoppers.
- Performing market research and competitor analysis.
- Optimising pricing strategies and inventory management.
- Studying product descriptions and image data for insights into consumer preferences.
Coverage
The data was collected on 11th March 2023, providing a snapshot of SheIn's product catalogue at that specific time. The dataset does not explicitly state a geographical scope, but example URLs suggest a focus on the US market (
https://us.shein.com/
). It captures information on over 109,000 clothing products.License
Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
Who Can Use It
This dataset is valuable for:
- E-commerce analysts seeking to understand market dynamics and product performance in the fashion retail space.
- Data scientists developing machine learning models for product recommendations or trend prediction.
- Market researchers analysing consumer behaviour and competitive landscapes within online fashion.
- Business intelligence professionals looking for data-driven insights to inform strategic decisions.
Dataset Name Suggestions
- SheIn Fashion Product Data
- SheIn E-Commerce Products (March 2023)
- SheIn Clothing Catalogue Dataset
- SheIn Online Retail Data
Attributes
Original Data Source: SheIn Clothing Catalogue Dataset