Opendatabay APP

Used Vehicle Transactions and Attributes

Data Science and Analytics

Tags and Keywords

Cars

Pricing

Mileage

Vehicles

Regression

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Used Vehicle Transactions and Attributes Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Data detailing 762,091 used vehicles scraped from the cars.com marketplace in April 2023. This resource serves as a powerful foundation for quantitative analysis of the US used car market. It includes crucial parameters for market segmentation, forecasting vehicle values, and understanding key factors influencing used car sales, such as mileage, accident history, and seller performance metrics.

Columns

The dataset features 20 distinct columns, detailing the vehicle characteristics and sale parameters:
  • manufacturer: The name of the vehicle's producer.
  • model: The specific name of the vehicle model.
  • year: The year the vehicle was manufactured, spanning from 1915 to 2024.
  • mileage: The distance the car has travelled since production.
  • engine: Specific details regarding the car engine.
  • transmission: The type of transmission installed (e.g., automatic, manual).
  • drivetrain: The type of drivetrain the vehicle employs (e.g., Front-wheel Drive, All-wheel Drive).
  • fuel_type: The type of fuel the vehicle consumes, with Gasoline being the most common type.
  • mpg: Miles the vehicle can travel per gallon of fuel.
  • exterior_color: The vehicle's exterior finish colour.
  • interior_color: The colour of the car's interior.
  • accidents_or_damage: A binary indicator specifying whether the car has been involved in accidents.
  • one_owner: A binary indicator denoting if the car was owned by only one person.
  • personal_use_only: A binary indicator showing if the car was used strictly for personal purposes.
  • seller_name: The name of the entity selling the vehicle.
  • seller_rating: The rating assigned to the seller (ranging from 1 to 5).
  • driver_rating: The rating given to the car by drivers (ranging from 1 to 5).
  • driver_reviews_num: The number of reviews left by drivers for the vehicle model.
  • price_drop: The amount of price reduction from the initial listing price.
  • price: The current listing price of the vehicle.

Distribution

The data is contained within a single file named cars.csv, weighing approximately 145.22 MB. The structure includes 20 columns and 762,091 records. While most columns are fully populated, certain fields, such as price_drop (46% missing) and seller_rating (28% missing), contain significant missing values.

Usage

This dataset is ideal for regression modelling, particularly for predicting the market price of used cars based on their attributes, condition, and seller quality. It supports analyses relating to vehicle depreciation rates tied to mileage and age, comparison studies of different car manufacturers, and evaluation of the impact of accidents or single ownership on resale value.

Coverage

The data covers the used car market within the United States. The records were scraped in April 2023. The scope of vehicle manufacturing years is extremely broad, spanning from classic cars produced in 1915 up to models from 2024.

License

CC0: Public Domain

Who Can Use It

Data scientists and machine learning engineers developing predictive price models; market researchers seeking insights into vehicle trends and consumer preferences; automotive industry professionals analysing inventory dynamics; and financial analysts evaluating asset depreciation.

Dataset Name Suggestions

  • US Used Vehicle Pricing and Features (2023)
  • Vast Used Car Market Data (cars.com scrape)
  • US Automobile Data Analysis Resource
  • Used Vehicle Transactions and Attributes

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

17/10/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format