Flipkart E-commerce Products
E-commerce & Online Transactions
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset features 20,000 product listings from Flipkart.com, a prominent Indian e-commerce platform. It is a pre-crawled subset derived from a much larger collection of over 5.8 million products. The dataset provides valuable context for understanding product details, pricing strategies, and brand presence within a major online retail environment. It is particularly useful for performing analyses related to product pricing, specifications, and brand performance.
Columns
- uniq_id: A unique identification number assigned by the website for each product.
- crawl_timestamp: The precise date and time when the product data was collected.
- product_url: The direct URL link to the product page on Flipkart.com.
- product_name: The name of the product as listed on the platform.
- product_category_tree: A hierarchical classification indicating the product's category, e.g., "Jewellery >> Necklaces & Chains >> Necklaces".
- pid: The Product ID.
- retail_price: The original retail price of the product. This column has some missing values.
- discounted_price: The price of the product after any discounts have been applied. This column also has some missing values.
- image: Links to images of the product. Some values are missing.
- is_FK_Advantage_product: A boolean field indicating whether the product is part of the Flipkart Advantage programme. Approximately 4% of products are part of this programme.
- description: A detailed description of the product page. A small number of values are missing.
- product_rating: The numerical rating given to the product by users. The majority of products (91%) have no rating available.
- overall_rating: The overall rating for the product. Similar to product_rating, 91% of products have no overall rating available.
- brand: The name of the product's brand. A significant portion (29%) of brand information is missing.
- product_specifications: Detailed specifications for the products, often in a structured format. A few values are missing.
Distribution
The dataset is provided as a CSV file (
flipkart_com-ecommerce_sample.csv
) and is approximately 38.11 MB in size. It contains 20,000 individual product records, each described by 15 distinct columns. Key identifiers such as uniq_id
, product_url
, and pid
each contain 20,000 unique values, indicating a one-to-one correspondence with the number of products. While most columns are complete, retail_price
and discounted_price
each have 78 missing entries, image
has 3 missing entries, description
has 2 missing entries, brand
has 5,861 missing entries, and product_specifications
has 14 missing entries.Usage
This dataset is ideal for:
- Pricing analysis: Examining retail and discounted prices to identify trends, popular price points, or discount strategies.
- Product specification analysis: Understanding the characteristics and attributes of various products.
- Brand analysis: Investigating brand presence, popularity, and product offerings on the e-commerce platform.
- E-commerce research: Gaining insights into product categorisation, descriptions, and ratings in an online retail environment.
- Recommendation system development: Although not explicitly stated as a use case, the product-centric data could support the development of recommendation models.
Coverage
The dataset primarily covers products listed on Flipkart.com, an Indian e-commerce store. The data collection, indicated by the
crawl_timestamp
column, spans from 1st December 2015 to 28th June 2016. There are no specific notes on demographic scope, as the data is product-centric.License
CC BY-SA 4.0
Who Can Use It
- Data analysts: To explore pricing patterns and product attributes.
- Market researchers: For competitive analysis of brands and product categories within the Indian e-commerce market.
- E-commerce strategists: To understand product listings, descriptions, and user engagement (ratings).
- Students and academics: For projects and research in e-commerce, data analysis, and web scraping.
Dataset Name Suggestions
- Flipkart E-commerce Products (2015-2016)
- Indian Online Retail Dataset
- Flipkart Product Listings Sample
- E-commerce Product Data (Flipkart)
Attributes
Original Data Source: Flipkart E-commerce Products