Beer Advocate Review Dataset
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers a large collection of beer reviews from Beer Advocate, making it ideal for individuals looking to apply their data skills. It encompasses 1.5 million reviews, each detailing various aspects of beer. The data includes ratings across five distinct categories: appearance, aroma, palate, taste, and overall impression. Additionally, each review contains specific product information and details about the user who submitted the review. It is a valuable resource for exploring how diverse factors contribute to a beer's taste and reception.
Columns
brewery_name
: The name of the brewery that produced the beer. (String)review_time
: The date and time when the review was submitted. (String)review_overall
: The reviewer's general rating of the beer, scored from 1 to 5. (Float)review_aroma
: The reviewer's rating for the beer's aroma, scored from 1 to 5. (Float)review_appearance
: The reviewer's rating for the beer's visual presentation, scored from 1 to 5. (Float)review_profilename
: The username of the individual who wrote the review. (String)beer_style
: The particular style or type of the beer. (String)review_palate
: The reviewer's rating for the beer's mouthfeel and texture, scored from 1 to 5. (Float)review_taste
: The reviewer's rating for the beer's flavour, scored from 1 to 5. (Float)beer_name
: The specific name of the beer. (String)beer_abv
: The alcohol by volume percentage of the beer. (Float)brewery_id
: A unique identifier for the brewery. (Float)beer_beerid
: A unique identifier for the specific beer. (Float)index
: An internal record index. (Float)
Distribution
The dataset is provided as a CSV file,
beer_reviews.csv
, and is approximately 201.64 MB in size. It contains 1.5 million beer reviews across 14 columns. The data structure is primarily composed of string and float types. While most columns have a near-perfect record count of 1.59 million valid entries, some columns like beer_abv
have around 96% valid entries (1.52 million), and review_profilename
has a small number of missing values (348). The brewery_name
column also has 15 missing entries.Usage
This dataset is suited for a variety of applications, including:
- Developing and honing data analysis skills.
- Building predictive models to forecast how well new beers might be rated.
- Creating beer recommender systems to suggest beers based on preferences.
- Designing web applications that assist users in discovering local breweries according to their taste.
- Analysing and comparing different beers based on their ratings.
- Investigating which aspects (appearance, aroma, palate, taste, overall) are most influential for reviewers.
- Filtering and analysing beers by style or alcohol content.
Coverage
The dataset captures reviews over a period from approximately 1996 to 2012, based on the
review_time
timestamps. The source of the reviews is Beer Advocate. Specific geographic or demographic coverage details beyond the platform name are not explicitly stated in the provided information.License
Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Who Can Use It
- Data enthusiasts and students looking to practice data skills with a real-world, engaging dataset.
- Researchers and analysts interested in consumer preferences within the beer industry.
- Developers aiming to build applications like beer recommenders or brewery locators.
- Anyone curious about beer, its characteristics, and how it is reviewed.
- Users should note that the license permits non-commercial use only.
Dataset Name Suggestions
- Beer Advocate Review Dataset
- Craft Beer Ratings & Reviews
- Brewery & Beer Review Data
- Beer Taste Profile Records
- Global Beer Review Archive
Attributes
Original Data Source: Beer Advocate Review Dataset