Opendatabay APP

Classic Boston Real Estate Valuation Data

Comodities & Real Estate

Tags and Keywords

Housing

Boston

Regression

Realestate

Benchmark

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Classic Boston Real Estate Valuation Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

The U.S. Census Service concerning housing in the Boston, Massachusetts area. Originally published in 1978 by Harrison, D. and Rubinfeld, D.L., this dataset has been widely used in literature to benchmark regression algorithms. It explores the relationship between various factors and housing prices, such as air quality and socio-economic indicators. The dataset is particularly useful for data exploration and developing predictive models for real estate prices.

Columns

  • CRIM: Per capita crime rate by town.
  • ZN: Proportion of residential land zoned for lots over 25,000 sq.ft.
  • INDUS: Proportion of non-retail business acres per town.
  • CHAS: A dummy variable for the Charles River, indicating if a tract bounds the river (1 if yes, 0 if no).
  • NOX: Concentration of nitric oxides in parts per 10 million.
  • RM: The average number of rooms per dwelling.
  • AGE: The proportion of owner-occupied units that were built before 1940.
  • DIS: Weighted distances to the five main Boston employment centres.
  • RAD: An index measuring accessibility to radial highways.
  • TAX: The full-value property tax rate per $10,000.
  • PTRATIO: The pupil-teacher ratio in the town.
  • B: Calculated as 1000(Bk - 0.63)^2, where Bk is the proportion of black residents by town.
  • LSTAT: The percentage of the population considered to be of a lower socio-economic status.
  • MEDV: The median value of owner-occupied homes, presented in thousands of dollars ($1000s).

Distribution

The data is structured in a single CSV file, Boston-house-price-data.csv, with a file size of 41.36 kB. It is a small dataset containing 506 cases (rows) and 14 columns, with no missing values. It is not expected to be updated.

Usage

This dataset is ideal for benchmarking regression models and exploring factors that influence real estate prices. Key applications include predictive modelling for housing prices, linear regression analysis, and general data exploration tasks.

Coverage

The data covers housing information for various towns within the Boston, Massachusetts area. The information was collected by the U.S. Census Service and originally published in 1978.

License

CC0: Public Domain

Who Can Use It

  • Data Scientists and Analysts: For training and testing regression models.
  • Real Estate Professionals: To understand historical market drivers in the Boston area.
  • Students and Academics: As a classic benchmark dataset for machine learning and statistics courses.
  • Urban Planners: To analyse relationships between housing values and urban characteristics like crime rates and accessibility.

Dataset Name Suggestions

  • Boston Area Housing and Demographics (1978)
  • Classic Boston Real Estate Valuation Data
  • Boston Housing Factors and Median Value
  • Socio-Economic Indicators and Housing Prices in Boston

Attributes

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

17/09/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format