Sephora Cosmetics Product Analytics
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is a collection of product information scraped from the Sephora website, containing details for over 9,000 beauty products. It was created as part of a data science project, focusing on web scraping methods. The dataset provides valuable insights into product characteristics, customer ratings, ingredient lists, and marketing flags from an e-commerce platform. It is particularly useful for exploring the influence of ingredients or brand names on product reception, such as the number of 'loves' or overall ratings.
Columns
- id: The product ID at Sephora's website.
- brand: The brand of the product at Sephora's website.
- category: The category of the product at Sephora's website.
- name: The name of the product at Sephora's website.
- size: The size of the product.
- rating: The rating of the product.
- number_of_reviews: The number of reviews of the product.
- love: The number of people loving the product.
- price: The price of the product.
- value_price: The value price of the product (for discounted products).
- URL: The URL link of the product.
- MarketingFlags: Boolean indicating if the product had marketing flags like 'exclusive' or 'sold online only'.
- MarketingFlags_content: The specific kinds of marketing flags associated with the product.
- options: Options available on the website for the product, such as colours and sizes.
- details: Product details available on the website.
- how_to_use: Instructions for using the product, if available.
- ingredients: The ingredients list of the product, if available.
- online_only: Indicator if the product is sold exclusively online.
- exclusive: Indicator if the product is sold exclusively on Sephora's website.
- limited_edition: Indicator if the product is a limited edition.
- limited_time_offer: Indicator if the product has a limited time offer.
Distribution
The dataset is typically provided in CSV format and is approximately 23.26 MB in size. It comprises 21 columns and contains 9,168 valid records. Sample files are usually updated separately to the platform.
Usage
This dataset is ideal for various analytical tasks, including:
- Analysing the impact of specific ingredients on product ratings and customer engagement (e.g., 'number of loves').
- Investigating the influence of brand reputation on product ratings, independent of ingredient quality.
- Market research in the beauty and cosmetics industry to identify trends, popular products, and pricing strategies.
- Developing predictive models for product success or consumer preferences.
- Understanding e-commerce product characteristics and marketing approaches.
Coverage
The data was collected from the Sephora website. It represents a snapshot of products available at the time of scraping and does not have an expected update frequency, meaning it is a static dataset. No specific geographic or demographic coverage is detailed beyond the source being Sephora.com.
License
CC0: Public Domain
Who Can Use It
- Data Scientists and Analysts: For developing machine learning models related to product attributes, ratings prediction, or market segmentation.
- Beauty Industry Professionals: For market trend analysis, competitive benchmarking, and understanding consumer preferences in cosmetics.
- E-commerce Strategists: To study product listing optimisation, pricing dynamics, and the effectiveness of marketing flags.
- Researchers: Exploring consumer behaviour in the beauty sector or the impact of product information on purchasing decisions.
Dataset Name Suggestions
- Sephora Product Insights Dataset
- Beauty Product Data with Ratings & Ingredients
- Sephora Cosmetics Product Analytics
- E-commerce Beauty Product Data
- Sephora Ratings & Ingredients Database
Attributes
Original Data Source: Sephora Cosmetics Product Analytics