Opendatabay APP

Emotion Annotated Indonesian Reviews

Reviews & Ratings

Tags and Keywords

Online

Nlp

Text

Neural

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Emotion Annotated Indonesian Reviews Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset is a collection of Indonesian product review data, meticulously annotated with emotion and sentiment labels. It was gathered from Tokopedia, a prominent e-commerce platform in Indonesia, encompassing product reviews from 29 distinct product categories. Each review is assigned a single emotion label, such as love, happiness, anger, fear, or sadness. The emotion annotation process was conducted by a group of annotators who followed specific criteria established by an expert in clinical psychology. The dataset also includes other valuable attributes related to the product reviews, including location, price, overall rating, number sold, total reviews, and customer rating, designed to facilitate further research. The data is considered clean.

Columns

While a specific original data sample is not available to list all columns in detail, based on the dataset description, the following attributes are included:
  • Product Review Text: The original review content.
  • Emotion Label: Categorical label indicating the primary emotion (e.g., love, happiness, anger, fear, sadness).
  • Sentiment Label: Overall sentiment associated with the review.
  • Location: Geographic information related to the review or product.
  • Price: The price of the product reviewed.
  • Overall Rating: The product's general rating.
  • Number Sold: The quantity of the product sold.
  • Total Review: The total number of reviews for the product.
  • Customer Rating: The rating provided by the customer for the specific product.

Distribution

The dataset is typically provided in a CSV file format. It contains product reviews from 29 different product categories. Specific figures for the total number of rows or records are not detailed in the provided information.

Usage

This dataset is ideally suited for various applications and research endeavours, including:
  • Learning: Excellent for educational purposes in data science, natural language processing, and text analytics.
  • Research: Supports in-depth studies in natural language processing (NLP), text processing, consumer emotion analysis, text mining, and sentiment analysis.
  • Model Training: Can be used for training machine learning models, including large language models (LLMs), for tasks such as emotion classification, sentiment analysis, and text understanding in Indonesian.
  • Application Development: Useful for developing applications that require understanding consumer feedback and emotions from product reviews.

Coverage

The dataset's geographic scope is focused on Indonesia, specifically product reviews from an Indonesian e-commerce platform, Tokopedia, written in the Indonesian language. The listed date for the dataset on the platform is 08/06/2025; however, the actual time range during which the data was collected for the reviews themselves is not specified in the sources. There are no specific notes on data availability for certain demographic groups or years beyond general product review consumers in Indonesia.

License

CCO

Who Can Use It

This dataset is beneficial for a wide range of users, including:
  • Academics and Researchers: For exploring topics in NLP, sentiment analysis, and consumer behaviour.
  • Students: As a practical resource for learning about text data processing, emotion classification, and data analysis.
  • Data Scientists and Machine Learning Engineers: For building and fine-tuning models capable of understanding and classifying emotions and sentiments from textual data.
  • Businesses: Potentially for market research and understanding customer feedback trends, particularly within the Indonesian e-commerce sector.

Dataset Name Suggestions

  • Indonesian Product Review Emotions
  • Tokopedia Emotion & Sentiment Dataset
  • Indonesian E-commerce Review Sentiment
  • PRDECT-ID: Indonesian Consumer Emotion Data
  • Emotion Annotated Indonesian Reviews

Attributes

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

08/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format