Opendatabay APP

Analysis-Ready Pakistan Retail Dataset

Retail & Consumer Behavior

Tags and Keywords

Pakistan

E-commerce

Retail

Business

Sales

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Analysis-Ready Pakistan Retail Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

A cleaned and processed collection of e-commerce orders from Pakistan, spanning from March 2016 to August 2018. This ready-to-analyse dataset offers a detailed look at half a million transactions, including item specifics, customer details, and payment methods. It has been preprocessed to handle missing values and inconsistencies, making it a robust resource for immediate use without requiring preliminary cleaning steps. The data integrity has been enhanced by correcting anomalies and outliers in numerical fields.

Columns

  • item_id: A unique integer identifier for each item.
  • status: The completion status of the order (e.g., complete, cancelled).
  • created_at: The date and time when the order was created.
  • sku: The unique Stock Keeping Unit for the product.
  • price: The price of the item in Pakistani Rupees.
  • qty_ordered: The quantity of the item ordered.
  • grand_total: The total price for the transaction.
  • increment_id: A unique identifier for each order.
  • category_name_1: The primary category of the item.
  • sales_commission_code: The commission code associated with the sale, if applicable.
  • discount_amount: The value of any discount applied to the transaction.
  • payment_method: The method used for payment (e.g., 'cod', 'Payaxis').
  • Working Date: The working date associated with the transaction.
  • BI Status: Business intelligence status, such as 'Net' or 'Gross'.
  • MV: A numerical field related to the transaction value.
  • Year: The year the transaction occurred.
  • Month: The month the transaction occurred.
  • Customer Since: The date the customer first made a purchase.
  • M-Y: The month and year of the transaction.
  • FY: The financial year of the transaction.
  • Customer ID: A unique identifier for each customer.

Distribution

The dataset contains approximately 585,000 records and 21 columns. It is available for download in CSV (89.96 MB) and Pickle formats. The data is fully populated, with no missing values in key columns.

Usage

This dataset is designed to support a wide range of analytical applications. It is particularly useful for exploring e-commerce trends within Pakistan, analysing market dynamics, and studying consumer behaviour patterns. The clean nature of the data facilitates efficient and accurate data manipulation for strategic decision-making in the retail and e-commerce sectors.

Coverage

  • Geographic: The data pertains to e-commerce transactions within Pakistan.
  • Time Range: The dataset covers the period from March 2016 to August 2018.

License

Attribution 4.0 International (CC BY 4.0)

Who Can Use It

  • Data Analysts: For market trend analysis and consumer behaviour studies.
  • Business Strategists: To inform strategic decision-making in the e-commerce sector.
  • Academic Researchers: For studies on emerging e-commerce markets.
  • Marketing Professionals: To understand customer purchasing patterns and popular product categories.

Dataset Name Suggestions

  • Pakistan E-Commerce Order Analysis (2016-2018)
  • Cleaned E-Commerce Transaction Data for Pakistan
  • Analysis-Ready Pakistan Retail Dataset
  • Pakistan Consumer Purchase History (E-Commerce)

Attributes

Listing Stats

VIEWS

1

DOWNLOADS

1

LISTED

17/09/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format