PakWheels Vehicle Listings Data
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Explaining data is a large collection of over 75,000 used car specifications and pricing details sourced directly from PakWheels, which operates as the largest online vehicle marketplace in Pakistan. The data was obtained via web scraping using the Python library BeautifulSoup. Given Pakistan's status as the world's fifth-largest country by population, the increasing demand for goods like cars makes this resource a valuable snapshot of the national used automotive market dynamics and characteristics.
Columns
The dataset contains details across 10 distinct columns:
- Make: The company that manufactured the vehicle. Toyota (39%) and Honda (23%) are the most frequently listed manufacturers.
- Name: The specific model and version of the vehicle. Examples include Corolla GLi 1.3 VVTi.
- Price: The sale price of the vehicle, which ranges from 500k up to 93.5 million. The average price is approximately 3.02 million.
- Year: The manufacturing year of the vehicle, spanning from 1940 to 2021.
- Engine Capacity(CC): The displacement of the engine in cubic centimeters (CC), with the mean capacity being approximately 1,500 CC.
- Engine Type: The type of fuel used by the car. Petrol is the dominant fuel type, making up 90% of the listings.
- Transmission: Indicates whether the vehicle is Automatic (58%) or Manual (42%).
- Mileage(kms): The total distance the car has been driven, reported in kilometers. The average mileage stands at about 88,000 km.
- City: The city in Pakistan where the car is being sold or where the dealer is located.
Distribution
The data is delivered in a CSV file format named PakWheelsDataSet.csv, sized at 6.79 MB. It contains 76,700 valid records. The data structure is robust; most critical fields show a 100% validity rate, indicating a negligible amount of missing information across the core attributes.
Usage
This data product is suited for a variety of analytical and business applications:
- Price Prediction Modelling: The structure supports the use of statistical techniques, such as linear regression, to develop accurate models for predicting used car prices based on specifications and location.
- Market Analysis: Researchers can analyze trends related to popular makes, engine types, and regional price variations within the Pakistani automotive sector.
- Business Intelligence: Car sellers and industry stakeholders can use the data to inform inventory choices and optimize pricing strategies based on current market listings.
Coverage
The geographic focus is the used car market within Pakistan. Listings are heavily concentrated in major metropolitan areas, particularly Lahore (21% of listings) and Karachi (20%). The manufacturing dates of the vehicles span from 1940 to 2021. The inventory is relatively modern, with the median manufacturing year being 2014, and three-quarters of the listings being from 2018 onwards.
License
CC0: Public Domain
Who Can Use It
The resource is valuable for users ranging from beginners to intermediate analysts.
- Machine Learning Practitioners: Ideal for building predictive models focused on transport economics.
- Students and Academics: Useful for research concerning urban mobility, consumer demand, and developing economies.
- Automotive Professionals: Those involved in sales, valuation, or dealership management who require market intelligence.
Dataset Name Suggestions
- Pakistan Used Car Specifications and Pricing
- PakWheels Vehicle Listings Data
- Automotive Market Dynamics Pakistan
Attributes
Original Data Source: PakWheels Vehicle Listings Data
Loading...
