Opendatabay APP

Avocado Price and Sales Volume Augmentation

Product Reviews & Feedback

Tags and Keywords

Avocado

Price

Sales

Retail

Volume

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Avocado Price and Sales Volume Augmentation Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This augmented dataset offers historical information on Hass avocado pricing and corresponding sales volume across numerous US markets. The data is derived from weekly retail scan information, capturing actual sales figures directly from retailers' cash registers. It provides context for understanding regional price variations and exploring market phenomena, such as price volatility and the economic choices faced by consumers purchasing staple items like avocado toast. The original data, downloaded in May 2018, has been further expanded using CTGAN augmentation techniques.

Columns

The data product contains 14 columns, detailing various metrics:
  • Date: The date of the market observation.
  • AveragePrice: The average cost per single avocado unit, even when units are sold in bulk bags (mean price is 1.33, ranging from 0.4 to 3.1).
  • type: Categorises the observation as either conventional or organic produce.
  • year: The calendar year of the observation.
  • Region: The specific city or geographical region where the retail observation was made.
  • Total Volume: The overall count of avocados sold.
  • 4046, 4225, 4770: The total unit count of avocados sold corresponding to specific Product Lookup codes (PLUs).
  • Total Bags, Small Bags, Large Bags, XLarge Bags: Detailed volume counts for avocados sold packaged in bags of differing sizes.
  • Unnamed: 0: An unused column included in the file structure.

Distribution

This product is structured as a single CSV file, Augmented_avocado.csv, with a size of 237.46 MB. It contains 1,000,000 total records (1000k valid entries). The data captures market dynamics on a weekly basis, reflecting national retail volume. The observations span a period from January 2015 up to March 2018. All records relate exclusively to Hass avocados.

Usage

This data product is highly suitable for tasks requiring analysis of agricultural commodity pricing and consumer behaviour. Potential applications include:
  • Regional Economic Analysis: Identifying markets where prices are lowest to explore regional cost of living insights.
  • Price Volatility Studies: Analysing historical price fluctuations, such as determining if the "Avocadopocalypse of 2017" was a demonstrable event.
  • Demand Forecasting: Utilising historical sales volume metrics (Total Volume, bag sizes, PLU categories) for predictive modelling.
  • Data Augmentation Practice: Leveraging the CTGAN augmented data for training machine learning models.

Coverage

The data spans from the beginning of 2015 through the first quarter of 2018. Geographic coverage includes multiple US markets, featuring 54 unique cities or regions. The retail scan data aggregates sales across multiple channels, including grocery, mass, club, drug, dollar, and military outlets. It distinguishes between conventional and organic avocado types.

License

CC0: Public Domain

Who Can Use It

  • Market Researchers: To analyse regional supply, demand, and price variations.
  • Data Scientists: To build predictive models for commodity pricing and retail volume based on augmented data.
  • Economists: To study market efficiency and consumer sensitivity to price changes over time.
  • Business Intelligence Professionals: To benchmark retail performance across different regions and sales channels.

Dataset Name Suggestions

  • US Retail Hass Avocado Market Data (2015-2018)
  • Avocado Price and Sales Volume Augmentation
  • US Regional Avocado Pricing and Consumption

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

08/11/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format