Avocado Price and Sales Volume Augmentation
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This augmented dataset offers historical information on Hass avocado pricing and corresponding sales volume across numerous US markets. The data is derived from weekly retail scan information, capturing actual sales figures directly from retailers' cash registers. It provides context for understanding regional price variations and exploring market phenomena, such as price volatility and the economic choices faced by consumers purchasing staple items like avocado toast. The original data, downloaded in May 2018, has been further expanded using CTGAN augmentation techniques.
Columns
The data product contains 14 columns, detailing various metrics:
- Date: The date of the market observation.
- AveragePrice: The average cost per single avocado unit, even when units are sold in bulk bags (mean price is 1.33, ranging from 0.4 to 3.1).
- type: Categorises the observation as either conventional or organic produce.
- year: The calendar year of the observation.
- Region: The specific city or geographical region where the retail observation was made.
- Total Volume: The overall count of avocados sold.
- 4046, 4225, 4770: The total unit count of avocados sold corresponding to specific Product Lookup codes (PLUs).
- Total Bags, Small Bags, Large Bags, XLarge Bags: Detailed volume counts for avocados sold packaged in bags of differing sizes.
- Unnamed: 0: An unused column included in the file structure.
Distribution
This product is structured as a single CSV file, Augmented_avocado.csv, with a size of 237.46 MB. It contains 1,000,000 total records (1000k valid entries). The data captures market dynamics on a weekly basis, reflecting national retail volume. The observations span a period from January 2015 up to March 2018. All records relate exclusively to Hass avocados.
Usage
This data product is highly suitable for tasks requiring analysis of agricultural commodity pricing and consumer behaviour. Potential applications include:
- Regional Economic Analysis: Identifying markets where prices are lowest to explore regional cost of living insights.
- Price Volatility Studies: Analysing historical price fluctuations, such as determining if the "Avocadopocalypse of 2017" was a demonstrable event.
- Demand Forecasting: Utilising historical sales volume metrics (Total Volume, bag sizes, PLU categories) for predictive modelling.
- Data Augmentation Practice: Leveraging the CTGAN augmented data for training machine learning models.
Coverage
The data spans from the beginning of 2015 through the first quarter of 2018. Geographic coverage includes multiple US markets, featuring 54 unique cities or regions. The retail scan data aggregates sales across multiple channels, including grocery, mass, club, drug, dollar, and military outlets. It distinguishes between conventional and organic avocado types.
License
CC0: Public Domain
Who Can Use It
- Market Researchers: To analyse regional supply, demand, and price variations.
- Data Scientists: To build predictive models for commodity pricing and retail volume based on augmented data.
- Economists: To study market efficiency and consumer sensitivity to price changes over time.
- Business Intelligence Professionals: To benchmark retail performance across different regions and sales channels.
Dataset Name Suggestions
- US Retail Hass Avocado Market Data (2015-2018)
- Avocado Price and Sales Volume Augmentation
- US Regional Avocado Pricing and Consumption
Attributes
Original Data Source: Avocado Price and Sales Volume Augmentation
Loading...
