Classic Boston Real Estate Valuation Data
Comodities & Real Estate
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
The U.S. Census Service concerning housing in the Boston, Massachusetts area. Originally published in 1978 by Harrison, D. and Rubinfeld, D.L., this dataset has been widely used in literature to benchmark regression algorithms. It explores the relationship between various factors and housing prices, such as air quality and socio-economic indicators. The dataset is particularly useful for data exploration and developing predictive models for real estate prices.
Columns
- CRIM: Per capita crime rate by town.
- ZN: Proportion of residential land zoned for lots over 25,000 sq.ft.
- INDUS: Proportion of non-retail business acres per town.
- CHAS: A dummy variable for the Charles River, indicating if a tract bounds the river (1 if yes, 0 if no).
- NOX: Concentration of nitric oxides in parts per 10 million.
- RM: The average number of rooms per dwelling.
- AGE: The proportion of owner-occupied units that were built before 1940.
- DIS: Weighted distances to the five main Boston employment centres.
- RAD: An index measuring accessibility to radial highways.
- TAX: The full-value property tax rate per $10,000.
- PTRATIO: The pupil-teacher ratio in the town.
- B: Calculated as 1000(Bk - 0.63)^2, where Bk is the proportion of black residents by town.
- LSTAT: The percentage of the population considered to be of a lower socio-economic status.
- MEDV: The median value of owner-occupied homes, presented in thousands of dollars ($1000s).
Distribution
The data is structured in a single CSV file,
Boston-house-price-data.csv
, with a file size of 41.36 kB. It is a small dataset containing 506 cases (rows) and 14 columns, with no missing values. It is not expected to be updated.Usage
This dataset is ideal for benchmarking regression models and exploring factors that influence real estate prices. Key applications include predictive modelling for housing prices, linear regression analysis, and general data exploration tasks.
Coverage
The data covers housing information for various towns within the Boston, Massachusetts area. The information was collected by the U.S. Census Service and originally published in 1978.
License
CC0: Public Domain
Who Can Use It
- Data Scientists and Analysts: For training and testing regression models.
- Real Estate Professionals: To understand historical market drivers in the Boston area.
- Students and Academics: As a classic benchmark dataset for machine learning and statistics courses.
- Urban Planners: To analyse relationships between housing values and urban characteristics like crime rates and accessibility.
Dataset Name Suggestions
- Boston Area Housing and Demographics (1978)
- Classic Boston Real Estate Valuation Data
- Boston Housing Factors and Median Value
- Socio-Economic Indicators and Housing Prices in Boston
Attributes
Original Data Source: Classic Boston Real Estate Valuation Data