Opendatabay APP

Global Wine Ratings Dataset

Product Reviews & Feedback

Tags and Keywords

Wine

Reviews

Alcohol

Ratings

Tasting

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Global Wine Ratings Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset offers a collection of approximately 130,000 wine reviews originally published by WineEnthusiast. It provides rich detail for each wine, including the taster's name, wine price, variety, a quality score, country of origin, specific county or province, and the vineyard. This dataset is invaluable for understanding trends in wine characteristics and market dynamics. Potential analytical explorations include investigating provincial average prices, correlations between review scores and price, and delving into reviewer specialisations or common terms used in reviews.

Columns

  • id: A unique identifier assigned to each wine review.
  • country: The country where the wine was produced. The dataset includes wines from various countries, with the United States and France being prominent.
  • description: A textual review detailing the wine's characteristics and flavour profile. This column contains a vast number of unique descriptions.
  • designation: Indicates the specific vineyard or property designation of the wine. Some entries in this column may be unavailable.
  • points: The score awarded to the wine by the reviewer, typically on a scale from 80 to 100. The average score is around 88.4 points.
  • price: The retail price of the wine in USD. Prices vary widely, with an average of approximately £35.40, though some entries are not available.
  • province: The province or state of origin for the wine. California is a frequently occurring province.
  • region_1: The primary wine region within the province or country. Napa Valley is a notable region within this column. Some entries may be missing.
  • region_2: A more specific, secondary wine region. This column has a considerable number of unavailable entries, with Central Coast appearing frequently when present.
  • taster_name: The name of the wine reviewer who provided the assessment. Roger Voss is a frequent contributor. Some names are not recorded.
  • taster_twitter_handle: The Twitter handle of the wine reviewer. Many entries are not available for this column.
  • title: The full title of the wine review, which often includes the wine name, vintage, and origin.
  • variety: The grape variety or blend used to produce the wine. Pinot Noir and Chardonnay are among the most common varieties.
  • winery: The name of the winery that produced the wine. There are many distinct wineries represented in the dataset.

Distribution

The dataset is provided as a CSV file, specifically named winemag-data-130k-v2.csv. It has a file size of 52.8 MB and comprises approximately 130,000 individual wine reviews. Each review is structured across 14 distinct columns.

Usage

This dataset is ideal for:
  • Market Analysis: Identifying regional pricing differences and market trends in the wine industry.
  • Predictive Modelling: Building models to predict wine prices based on review scores or other attributes.
  • Text Analysis: Extracting common terms associated with positive or negative wine reviews to understand descriptive language.
  • Reviewer Profiling: Analysing reviewer specialisations, such as their preferred regions or wine varieties.
  • Geospatial Analysis: Mapping wine origins and exploring their geographic distribution and characteristics.

Coverage

The dataset's geographic scope is broad, covering wines from numerous countries, provinces, and specific regions worldwide, with a notable presence from the United States (especially California) and France. Information on individual tasters, including their names and Twitter handles, is available, though some data points for these fields are not present. The dataset represents a historical collection of reviews and is not expected to receive future updates, meaning it provides a static snapshot of wine reviews from WineEnthusiast.

License

CC0: Public Domain

Who Can Use It

  • Data Scientists and Analysts: For conducting statistical analysis, building machine learning models, and performing advanced text mining on review descriptions.
  • Wine Industry Professionals: Such as marketers, distributors, and producers, for market research, competitive analysis, and identifying consumer preferences.
  • Academic Researchers: For studies in consumer behaviour, linguistics, and economic patterns within the beverage industry.
  • Wine Enthusiasts and Bloggers: To explore wine characteristics, compare reviews, and gain deeper insights into their favourite vintages and regions.

Dataset Name Suggestions

  • WineEnthusiast Review Data
  • Global Wine Ratings Dataset
  • Vintage Wine Reviews
  • Wine Marketplace Data

Attributes

Original Data Source: Global Wine Ratings Dataset

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

13/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format