Poland Used Car Market Data
Stock & Market Data
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is designed to predict sale prices for cars in Poland. Assembled in January 2022, it contains data publicly sourced from a well-known car sale website in Poland, gathered using Python tools like Selenium and requests. It offers valuable information for training data analysis skills, creating neural networks, and building robust models for car price prediction within the Polish market. The creator, a Python developer, shares this dataset to assist others in their machine learning and data analysis endeavours.
Columns
- Index of listing: This column represents the index of each car listing.
- mark: The make of the car, such as Audi or Opel.
- model: The specific model of the car, for instance, Astra or Seria-3.
- generation_name: Provides the formatted generation name of the car, though this information is not always present. An example would be 'gen-8p-2003-2012'.
- year: The year of production for the car, with data ranging from 1945 to 2022.
- mileage: The car's mileage in kilometres (KM), covering a range from 0 up to 2.8 million KM.
- vol_engine: Indicates the auto engine size, with values spanning from 0 to 7600.
- fuel: Specifies the engine type, primarily Gasoline or Diesel.
- city: The locality in Poland where the car is listed, such as Warszawa or Łódź.
- province: The region of Poland where the car is located, for example, Mazowieckie or Śląskie.
- price: The price of the car in Polish Złoty (PLN), where 1 USD is approximately 1 PLN. Prices range from 500 PLN to 2.4 million PLN.
Distribution
The dataset is typically available as a CSV file (specifically, Car_Prices_Poland_Kaggle.csv). It has a file size of 9.89 MB and is structured as tabular data. The dataset contains approximately 118,000 records across 11 columns.
Usage
This dataset is ideal for:
- Developing and training machine learning models for car price prediction.
- Analysing factors influencing car prices in the Polish market.
- Building neural networks for regression tasks.
- Practising data parsing and analysis techniques.
- Creating notebooks to demonstrate various model applications and insights.
Coverage
- Geographic Scope: The data exclusively covers Poland, including specific cities and regions.
- Time Range: The dataset was assembled in January 2022, with car production years ranging from 1945 to 2022.
- Data Availability: The data was collected from a publicly accessible car sale website in Poland.
License
CC0: Public Domain
Who Can Use It
This dataset is particularly useful for:
- Python developers interested in real-world data projects.
- Data scientists and analysts focusing on predictive modelling and market insights.
- Machine learning engineers looking for a dataset to hone their regression and neural network skills.
- Students and researchers studying automotive markets or data analysis techniques.
Dataset Name Suggestions
- Polish Car Price Prediction
- Poland Used Car Market Data
- Automobile Prices Poland
- Polish Vehicle Sales Data
- Car Valuation Poland Dataset
Attributes
Original Data Source: Poland Used Car Market Data