King County Residential Price Predictor
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
A detailed record of residential property sales in the King County area, USA, designed to support the development of a strong predictive model for estimating house prices. This resource includes crucial variables such as property size, sale history, number of rooms, and building quality, alongside external real estate value indices. This data is essential for real estate agencies seeking a powerful tool to provide accurate market estimates to their clients, enabling informed selling or purchasing decisions.
Columns
- DocumentDate: The specific date when the property transaction occurred.
- SalePrice: The recorded sale price of the property. Values range from 3,000 to 11.0 million.
- PropertyID: A unique identifier assigned to the individual property.
- PropertyType: Categorises the type of property, predominantly Single Family (91%) and Townhouse (8%).
- ym: Year and month of the sale, formatted as YYYY-MM.
- zhvi_px: The Zillow Home Value Index, representing the average price. The mean value is approximately 392k.
- zhvi_idx: The Zillow Home Value Index, provided in a normalized format.
- AdjSalePrice: The sale price after being adjusted for inflation.
- NbrLivingUnits: The count of residential units found on the property. Most properties (over 22,300) have one unit.
- SqFtLot: The total size of the property lot, measured in square feet.
- SqFtTotLiving: The overall living area of the property, measured in square feet. The mean area is roughly 2.08k square feet.
- SqFtFinBasement: The size of the finished basement area, in square feet.
- Bathrooms: The total number of bathrooms present in the property, with a mean of 2.18.
- Bedrooms: The total number of bedrooms in the property, with a mean of 3.37.
- BldgGrade: An overall quality assessment of the building, graded on a scale of 1 to 13.
- YrBuilt: The calendar year in which the building was originally constructed.
- YrRenovated: The year of the property's last renovation (listed as 0 if never renovated).
- TrafficNoise: An indicator of the level of traffic-related noise (rated 0 to 3).
- LandVal: The estimated monetary value of the land itself.
- ImpsVal: The estimated monetary value of the property improvements (structures and buildings).
- ZipCode: The postal code where the property is located.
- NewConstruction: A boolean field indicating whether the property is a new construction (True/False).
Distribution
The data consists of 22 variables and contains 22,688 individual sales records. The associated file size is 3.32 MB, usually provided in a CSV format (
house_sales.csv). The data is static and the expected update frequency is never.Usage
This data is perfectly suited for developing advanced machine learning models, specifically for regression tasks focused on predicting residential prices. It is also an excellent resource for data visualization projects and serves as highly usable tabular data for beginners in data science and statistical analysis.
Coverage
The geographic scope covers residential property sales within the King County area of the United States. The temporal coverage spans sales recorded from January 3, 2006, through to May 29, 2015. The dataset covers different property types, including Single Family homes and Townhouses.
License
CC0: Public Domain
Who Can Use It
The dataset is intended for use by real estate professionals and agencies who need robust tools to estimate property values quickly and accurately. It is also valuable for data scientists, statisticians, and academic researchers focused on econometric modelling, housing market dynamics, and predictive analytics.
Dataset Name Suggestions
- King County Residential Price Predictor
- US Housing Sales Data
- King County Real Estate Values 2006-2015
- Property Price Forecasting Dataset
Attributes
Original Data Source: King County Residential Price Predictor
Loading...
