Lazada Product Sentiment Data
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset contains product reviews from Lazada Indonesia, organised by various product categories. Its primary purpose is to provide a collection of Indonesian product reviews, particularly useful for sentiment analysis studies. The data was collected using Puppeteer and initially categorised, though merged CSV files became the standard from October 2nd, 2019. It offers insights into consumer opinions and product performance within the Indonesian e-commerce landscape.
Columns
- itemId: A unique identifier for each item.
- category: The category to which the item belongs, such as 'beli-harddisk-eksternal' or 'jual-flash-drives'.
- name: The name of the item.
- brandName: The brand of the item, for instance, SanDisk or Asus.
- url: The direct URL to the item's page on Lazada.
- price: The price of the item.
- averageRating: The average rating given to the item. Ratings range from 1 to 5.
- totalReviews: The total number of reviews an item has received.
- retrievedDate: The date when the data for the item was retrieved, typically 2019-10-02.
Distribution
The dataset is provided in CSV file format, with
yyyymmdd-items.csv
containing item entries and yyyymmdd-reviews.csv
containing corresponding reviews. A categories.txt
file lists the available categories. From October 2nd, 2019, published datasets were merged into single CSV files. The 20191002-items.csv
file, for example, is approximately 3.02 MB and contains over 10,900 valid records. Specific numbers for rows or records in the reviews file are not detailed, but it contains reviews for items listed in the items file.Usage
This dataset is ideal for:
- Sentiment analysis research on Indonesian product reviews.
- Market research to understand consumer preferences and pain points in the Indonesian e-commerce sector.
- Developing and testing natural language processing (NLP) models for the Indonesian language.
- Analysing product performance and customer satisfaction based on ratings and reviews.
- Identifying popular product categories and brands on Lazada Indonesia.
Coverage
The dataset focuses on Lazada Indonesia, providing a geographical scope limited to Indonesia. The data was primarily retrieved on October 2nd, 2019. There are no specific notes on data availability for particular demographic groups, as it reflects general consumer reviews on the platform.
License
CC0: Public Domain
Who Can Use It
- Academic Researchers: For theses on sentiment analysis, NLP, or e-commerce studies.
- Data Scientists and Analysts: To build predictive models or derive business intelligence from product review data.
- Businesses and Marketers: To gain insights into consumer behaviour and product perception in the Indonesian market.
- Developers: Interested in creating applications that process or leverage product review data.
Dataset Name Suggestions
- Lazada Indonesia Review Dataset
- Indonesian E-commerce Product Reviews
- Lazada Product Sentiment Data
- ID Consumer Reviews
- Indonesia Online Shopping Reviews
Attributes
Original Data Source: Lazada Product Sentiment Data