Supermarket Product Scrape Data
Retail & Consumer Behavior
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Offers detailed retail pricing intelligence derived from major UK supermarket websites. It provides essential variables needed for detailed competitive analysis, strategic decision-making, and modeling consumer preferences in the food and health product sectors. The data collection process is rigorous, involving customized functions for systematic web navigation and dedicated data quality scripts to validate extracted information, ensuring a high level of data integrity.
Columns
supermarket: Identifies the retailer associated with the product listing. The sample file, All_Data_ASDA.csv, currently contains one unique value: ASDA.prices_(£): The price of the product listed in pounds sterling. Prices range widely, from a minimum of £0.05 up to £480. The average price is £5.75.prices_unit_(£): The calculated price based on the standard unit of measure, facilitating direct comparisons across product sizes. Values span a large range, extending up to 99.8k.unit: Specifies the physical unit used for pricing, such as 'kg' (43% of records) or 'unit' (35% of records).names: The full product name or description. The dataset features nearly 28.8k unique product names.date: The date when the data was extracted, typically formatted as YYYYMMDD. Sample dates are concentrated within January 2024.category: The general product grouping. 'food_cupboard' (21%) and 'health_products' (16%) are the most frequently occurring categories.own_brand: A boolean indicator identifying whether the product is a store’s own label (True, 30% of records) or a third-party brand (False, 70% of records).
Distribution
The data files are typically provided in CSV format. The sample file, All_Data_ASDA.csv, is 46.09 MB and holds approximately 539k records across 8 columns. Although rigorous quality assurance is applied, the data exhibits very few missing values; for instance, two records are missing from
prices_(£) and sixteen from names and own_brand.Usage
- Uncovering market trends through monitoring price changes and calculating weekly category averages.
- Gaining insight into pricing psychology and competitive tactics using advanced analysis.
- Supporting strategic decision-making in inventory management and promotional strategy.
- Building recommendation engines for personalized product suggestions, often leveraging machine learning techniques like Singular Value Decomposition.
- Creating visualisations, such as tables, graphs, word clouds, and treemaps, to illustrate pricing patterns and brand popularity.
Coverage
The data covers major UK supermarket retailers (Aldi, ASDA, Morrisons, Sainsbury's, and Tesco). The time frame detailed in the samples is concentrated within January 2024. The scope includes diverse product categories, primarily focusing on food items and health goods. The expected update schedule for this dataset is annually.
License
CC0: Public Domain
Who Can Use It
- Retail Strategists: To benchmark current store pricing against key competitors and identify areas for optimization based on observed Retail Analytics Trends.
- Data Analysts: For structured data manipulation and transformation needed to reveal pricing patterns and brand distributions.
- Developers: For integrating current pricing information into external applications or interfaces, potentially deployed via cloud services like Streamlit.
Dataset Name Suggestions
- UK Retail Price Dynamics
- Supermarket Product Scrape Data
- Grocery Pricing Intelligence
- UK Retail Market View
Attributes
Original Data Source: Supermarket Product Scrape Data
Loading...
