Opendatabay APP

Amazon Customer Review Dataset

Reviews & Ratings

Tags and Keywords

Computer

Text

Nlp

E-commerce

Languages

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Amazon Customer Review Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset is a collection of customer reviews obtained from Amazon.com. It is designed for multilingual sentiment analysis and opinion mining, containing reviews in five different languages: Italian, German, Spanish, French, and English. The dataset is valuable for natural language processing tasks, sentiment analysis algorithms, and various machine learning applications that require diverse language data for training and evaluation. It can be used to train and fine-tune models to automatically classify sentiments, predict customer satisfaction, and extract key information from customer reviews.

Columns

  • user_name: The name of the reviewer.
  • stars: The number of stars awarded in the review.
  • country: The country of the reviewer.
  • date: The date when the review was posted.
  • title: The title of the review.
  • text: The main body of the review text.
  • helpful: The count of people who found the review useful.

Distribution

The dataset is typically provided in a CSV file format. While specific total row counts are not available, examples of column value distributions are present, such as 675 total values for user names and 640 total values for star ratings, with 92% being 5/5 reviews. The dataset is structured to support various text and NLP applications.

Usage

This dataset is ideal for a range of applications, including:
  • Multilingual sentiment analysis.
  • Opinion mining studies.
  • Developing and testing natural language processing tasks.
  • Building sentiment analysis algorithms.
  • Training machine learning models to classify sentiments.
  • Predicting customer satisfaction from review data.
  • Extracting key insights and information from customer feedback.

Coverage

The dataset's coverage is global, drawing reviews from Amazon.com. It includes content in Italian, German, Spanish, French, and English, indicating its relevance to regions where these languages are spoken. The dataset contains a 'date' column for each review; however, a specific time range for the reviews themselves is not provided.

License

CC-BY-NC

Who Can Use It

This dataset is suitable for:
  • Data Scientists and Researchers: For developing and testing machine learning models for sentiment analysis, NLP, and text classification across multiple languages.
  • E-commerce Analysts: To understand customer satisfaction, product performance, and market sentiment from user reviews.
  • Language Model Developers: To fine-tune large language models with diverse text data for improved natural language understanding.
  • Businesses: To gain insights into customer feedback and improve product or service offerings.

Dataset Name Suggestions

  • Amazon Customer Review Data
  • Multilingual Amazon Product Reviews
  • E-commerce Customer Sentiment Data
  • Global Amazon Review Collection

Attributes

Original Data Source: Amazon Review Dataset LLM

Listing Stats

VIEWS

2

DOWNLOADS

0

LISTED

08/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free