Opendatabay APP

Lazada Product Sentiment Data

Product Reviews & Feedback

Tags and Keywords

Reviews

Indonesia

Lazada

Product

Sentiment

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Lazada Product Sentiment Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset contains product reviews from Lazada Indonesia, organised by various product categories. Its primary purpose is to provide a collection of Indonesian product reviews, particularly useful for sentiment analysis studies. The data was collected using Puppeteer and initially categorised, though merged CSV files became the standard from October 2nd, 2019. It offers insights into consumer opinions and product performance within the Indonesian e-commerce landscape.

Columns

  • itemId: A unique identifier for each item.
  • category: The category to which the item belongs, such as 'beli-harddisk-eksternal' or 'jual-flash-drives'.
  • name: The name of the item.
  • brandName: The brand of the item, for instance, SanDisk or Asus.
  • url: The direct URL to the item's page on Lazada.
  • price: The price of the item.
  • averageRating: The average rating given to the item. Ratings range from 1 to 5.
  • totalReviews: The total number of reviews an item has received.
  • retrievedDate: The date when the data for the item was retrieved, typically 2019-10-02.

Distribution

The dataset is provided in CSV file format, with yyyymmdd-items.csv containing item entries and yyyymmdd-reviews.csv containing corresponding reviews. A categories.txt file lists the available categories. From October 2nd, 2019, published datasets were merged into single CSV files. The 20191002-items.csv file, for example, is approximately 3.02 MB and contains over 10,900 valid records. Specific numbers for rows or records in the reviews file are not detailed, but it contains reviews for items listed in the items file.

Usage

This dataset is ideal for:
  • Sentiment analysis research on Indonesian product reviews.
  • Market research to understand consumer preferences and pain points in the Indonesian e-commerce sector.
  • Developing and testing natural language processing (NLP) models for the Indonesian language.
  • Analysing product performance and customer satisfaction based on ratings and reviews.
  • Identifying popular product categories and brands on Lazada Indonesia.

Coverage

The dataset focuses on Lazada Indonesia, providing a geographical scope limited to Indonesia. The data was primarily retrieved on October 2nd, 2019. There are no specific notes on data availability for particular demographic groups, as it reflects general consumer reviews on the platform.

License

CC0: Public Domain

Who Can Use It

  • Academic Researchers: For theses on sentiment analysis, NLP, or e-commerce studies.
  • Data Scientists and Analysts: To build predictive models or derive business intelligence from product review data.
  • Businesses and Marketers: To gain insights into consumer behaviour and product perception in the Indonesian market.
  • Developers: Interested in creating applications that process or leverage product review data.

Dataset Name Suggestions

  • Lazada Indonesia Review Dataset
  • Indonesian E-commerce Product Reviews
  • Lazada Product Sentiment Data
  • ID Consumer Reviews
  • Indonesia Online Shopping Reviews

Attributes

Original Data Source: Lazada Product Sentiment Data

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

25/07/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format