Opendatabay APP

Global Beer Characteristics Dataset

Retail & Consumer Behavior

Tags and Keywords

Retail

Nlp

Clustering

Alcohol

Recommender

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Global Beer Characteristics Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a detailed collection of tasting profiles and consumer reviews for 3197 unique beers from 934 different breweries. It was created by integrating information from two existing datasets: "Beer Tasting Profiles Dataset" and "1.5 Million Beer Reviews". The primary purpose is to offer a unified resource containing consumer review scores for aroma, appearance, palate, taste, and overall quality, alongside detailed tasting profiles for various brews. This consolidated data allows for deeper analysis of beer characteristics and consumer preferences.

Columns

  • Name: The name or label of the beer.
  • Style: The style of beer.
  • Brewery: The name of the brewery.
  • Beer Name (Full): The complete beer name, combining Brewery and Brew Name, serving as a unique identifier for each beer.
  • Description: Any available notes on the beer.
  • ABV: The alcohol content of the beer, expressed as a percentage by volume.
  • Min IBU: The minimum International Bitterness Unit value a beer of its style can possess.
  • Max IBU: The maximum International Bitterness Unit value a beer of its style can possess.
  • Astringency: A tasting profile feature describing mouthfeel, calculated from word counts in reviews.
  • Body: A tasting profile feature describing mouthfeel, calculated from word counts in reviews.
  • Alcohol: A tasting profile feature describing mouthfeel, calculated from word counts in reviews.
  • Bitter: A tasting profile feature describing taste, calculated from word counts in reviews.
  • Sweet: A tasting profile feature describing taste, calculated from word counts in reviews.
  • Sour: A tasting profile feature describing taste, calculated from word counts in reviews.
  • Salty: A tasting profile feature describing taste, calculated from word counts in reviews.
  • Fruits: A tasting profile feature describing flavour and aroma, calculated from word counts in reviews.
  • Hoppy: A tasting profile feature describing flavour and aroma, calculated from word counts in reviews.
  • Spices: A tasting profile feature describing flavour and aroma, calculated from word counts in reviews.
  • Malty: A tasting profile feature describing flavour and aroma, calculated from word counts in reviews.
  • review_aroma: The average rating score for the beer's aroma from consumer reviews.
  • review_appearance: The average rating score for the beer's appearance from consumer reviews.
  • review_palate: The average rating score for the beer's palate from consumer reviews.
  • review_taste: The average rating score for the beer's taste from consumer reviews.
  • review_overall: The average overall rating score from consumer reviews.
  • number_of_reviews: The total count of consumer reviews for the beer.
The tasting profile features (Astringency through Malty) are derived from word counts found in up to 25 reviews for each beer, based on a predefined list of beer descriptors.

Distribution

The primary dataset is provided in a CSV file named beer_profile_and_ratings.csv. Additional files, Brewery Name Fuzzy Match List.csv and Beer Name Fuzzy Match List.csv, list breweries and beers included from the source datasets. The dataset contains 3197 unique beers and 934 different breweries. It holds a quality rating of 5 out of 5 and is version 1.0.

Usage

This dataset is suitable for a variety of analytical and machine learning applications, including:
  • Analysing the properties that make a highly-rated beer.
  • Clustering and building a beer recommendation system based on similarities.
  • Classifying different beer styles based on tasting profile information.
  • Predicting a brew's alcohol content (ABV) using known characteristics.

Coverage

The dataset covers 3197 unique beers and 934 different breweries, with a global region scope. No specific time range or demographic information is available.

License

CC-BY

Who Can Use It

  • Data scientists and machine learning engineers for developing prediction models, clustering algorithms, and recommendation systems.
  • Researchers interested in consumer behaviour, product characteristics, and sensory analysis within the beverage industry.
  • Beer enthusiasts and connoisseurs looking for detailed insights into beer tasting profiles and ratings.
  • Developers creating applications related to beer discovery or review.

Dataset Name Suggestions

  • Beer Tasting Profiles and Ratings
  • Brewery and Beer Review Data
  • Global Beer Characteristics Dataset
  • Consumer Beer Insights

Attributes

Original Data Source: Beer Profile and Ratings Data Set

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

16/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free