Automotive Price and Spec Dataset
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is designed for analysing car-related information, providing valuable insights into vehicle characteristics and market dynamics. It serves various purposes such as market research, product development, predictive modelling, and risk assessment. The data is particularly useful for academic research in fields like transportation engineering, automotive engineering, data science, and machine learning. Numerous organisations contribute to the creation and distribution of car datasets, making this a relevant resource for studying the automotive industry.
Columns
- S.No./Sno: A serial number or identifier for each record. It contains 7253 valid entries with a mean of 3.63k and ranges from 0 to 7252.
- Name/Namw: Specifies the model name of the car. There are 2041 unique car names, with "Mahindra XUV500 W8 2WD" being the most common. All 7253 entries are valid.
- Location/Loc: Indicates the primary location associated with the vehicle. There are 11 unique locations, with "Mumbai" (13%) and "Hyderabad" (12%) being the most frequent. All 7253 entries are valid.
- Year/Year: Represents the manufacturing year of the car. The data spans from 1996 to 2019, with a mean year of 2014. All 7253 entries are valid.
- Kilometers_Driven/Km: Records the total distance driven by the vehicle. The values range from 171 km to 6.50 million km, with a mean of 58.7k km. All 7253 entries are valid.
- Fuel_Type/FuelType: Describes the type of fuel the car uses. The most common fuel types are Diesel (53%) and Petrol (46%). There are 5 unique fuel types, and all 7253 entries are valid.
- Transmission/Transmission: Indicates the car's transmission type. Manual transmission accounts for 72% of the records, while Automatic accounts for 28%. All 7253 entries are valid.
- Owner_Type/Owner: Specifies the ownership status of the car. "First" owners comprise 82% of the records, and "Second" owners make up 16%. There are 4 unique owner types, and all 7253 entries are valid.
- Mileage/Mileage: Refers to the car's fuel efficiency, typically in kmpl. There are 450 unique mileage values, with "17.0 kmpl" and "18.9 kmpl" being the most frequent (3% each). 7251 entries are valid, with 2 missing.
- Engine/Engine: Represents the engine displacement in CC. "1197 CC" (10%) and "1248 CC" (8%) are the most common engine sizes among 150 unique values. 7207 entries are valid, with 46 missing.
- Power: Indicates the engine's power output in bhp. "74 bhp" (4%) and "98.6 bhp" (2%) are common among 386 unique values. 7207 entries are valid, with 46 missing.
- Seats: Denotes the number of seating positions in the car. The mean number of seats is 5.28, with the majority of vehicles having 5 seats. Values range from 0 to 10 seats. 7200 entries are valid, with 53 missing.
- New_Price: Provides the original new price of the car. This column has a high percentage of missing values (86%), with only 1006 valid entries. "95.13 Lakh" is a frequently occurring value.
- Price: Represents the current selling price of the car. Prices range from 0.44 to 160, with a mean of 9.48. 6019 entries are valid, with 1234 missing (17%).
Distribution
The dataset is provided in CSV format (
used_cars_data.csv
) and has a size of 785.45 kB. It consists of 14 columns and contains 7253 records in total, with various levels of validity across different columns as detailed above.Usage
This dataset is ideal for:
- Conducting market research on car sales and trends.
- Aiding in product development for new vehicle models.
- Building predictive models for car prices or market demand.
- Performing risk assessment in automotive financing or insurance.
- Facilitating academic research in transportation and automotive engineering, data science, and machine learning.
Coverage
The data covers a time range from 1996 to 2019, providing a historical perspective on vehicle characteristics. Geographically, it includes vehicles from various locations across India, with significant representation from Mumbai (13%) and Hyderabad (12%), and a large proportion from other cities. It's noted that the
New_Price
column has a high percentage of missing data (86%), and the Price
column also has a significant number of missing values (17%).License
CC0: Public Domain
Who Can Use It
- Automotive manufacturers: For understanding market demand and competitive analysis.
- Research institutions: For academic studies in automotive and data science fields.
- Government agencies: For policy making related to transportation and vehicle regulations.
- Data providers: For enriching their automotive data offerings.
- Individuals interested in market research, product development, predictive modelling, risk assessment, or academic research related to automobiles.
Dataset Name Suggestions
- Used Car Market Analytics
- Automotive Price and Spec Dataset
- Vehicle Sales Data India
- Car Data Insights
- Automobile Specifications Database
Attributes
Original Data Source: Automotive Price and Spec Dataset