Cleaned Used Car Market Data
E-commerce & Online Transactions
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This cleaned version of vehicle sales information provides detailed records of used car transactions. The dataset is designed to offer robust insights into vehicle characteristics, market trends (MMR values), selling prices, and transaction details. It is a highly usable resource for analysing how factors like condition, mileage, and vehicle type influence pricing and sales outcomes. The data has undergone substantial cleaning, resulting in a reliable set of approximately 440,000 usable rows, despite the original data having over 11% null values primarily in the transmission field.
Columns
The dataset includes 16 fields covering vehicle specifics, transaction information, and market context:
- COMPANY: The name of the manufacturer of the car (e.g., Ford, Chevrolet).
- MODEL: The specific model of the vehicle (e.g., Altima, F-150).
- TYPE: The detailed type of the car.
- SIZE: The physical size classification of the car (e.g., Sedan, SUV).
- transmission: Details the car's gear system, indicating whether it is automatic or manual (automatic makes up 85% of records).
- state: The US State where the vehicle was sold (38 unique states are represented, with Florida and California being the most common).
- condition: A superficial numerical value reflecting the car's condition (mean score is 30.6).
- odometer: The mileage the car had run at the time of sale, measured in kilometres (mean reading is 68.7k).
- color: The exterior colour of the car (black and white are the most frequent).
- interior: The colour of the car's interior (black and gray are the most frequent).
- seller: Identification or name of the entity selling the vehicle.
- mmr: The estimated market value of the vehicle, offering a measure of market trends (mean value is 13.7k).
- sellingprice: The final transaction price of the vehicle (mean price is 13.5k).
- sale Day: The day of the week the transaction occurred (Tuesday is the most common sale day).
- Sale month: The month the vehicle was sold (February is the most common sale month).
- Sale year: The year of the sale transaction (primarily 2014 and 2015).
Distribution
The data file is typically provided in a CSV format. It contains 16 columns in total. The initial dataset contained 550,000 total rows; however, after cleaning and removing nulls and garbage values, there are approximately 440,000 usable records available for analysis.
Usage
This data is ideal for several analytical applications:
- Market Trend Analysis: Assessing fluctuations using MMR values.
- Pricing Strategy: Developing predictive models for used car selling prices based on characteristics like mileage and condition.
- Sales Performance: Analysing regional and temporal sales patterns (by state, day, and month).
- Data Cleaning Benchmarks: Using the cleaned version as a reference for pre-processing techniques.
Coverage
The data covers transactions that took place during 2014 and 2015. Geographically, it spans 38 unique states in the US. No specific demographic scope is provided, but it pertains to the sales of various vehicle makes and models.
License
CC0: Public Domain
Who Can Use It
- Data Scientists: For training machine learning models to predict used car prices or market valuation.
- Business Analysts: To understand factors influencing vehicle depreciation and market demand across different states.
- Automotive Researchers: To study market trends, specific vehicle characteristics (like type and size), and sales velocity.
Dataset Name Suggestions
- Cleaned Used Car Market Data
- Automobile Sales and Pricing (2014-2015)
- Pre-owned Vehicle Transaction Metrics
- US Vehicle Sales Data (Cleaned)
Attributes
Original Data Source: Cleaned Used Car Market Data
Loading...
