Opendatabay APP

Galtons Regression Toward the Mean Dataset

Cultural & Historical Records

Tags and Keywords

Height

Heredity

Regression

Biometrics

Galton

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Galtons Regression Toward the Mean Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Investigating the hereditary patterns of human stature provides a fundamental look into the biological connections between generations. Based on the landmark 1885 study by Francis Galton, these records explore the relationship between the heights of adult children and their parents. This archive is a cornerstone of modern statistics, having introduced the concept of regression toward the mean, and remains a vital tool for understanding how physical traits are passed down within families.

Columns

  • Father_height: The height of the father, measured in inches.
  • Mother_height: The height of the mother, measured in inches.
  • Child_height: The height of the adult child, measured in inches.
  • gender: The biological gender of the child, encoded as male (1) or female (0).

Distribution

The information is delivered in a CSV file titled Galton_Family_Heights.csv with a file size of approximately 26.21 kB. It consists of 934 valid records structured across 4 distinct columns. The data maintains 100% integrity with no missing or mismatched entries reported. As this represents a historical study, the update frequency is set to never.

Usage

This resource is ideal for practicing multiple linear regression and exploring the statistical relationship between hereditary variables. It is well-suited for beginners and intermediate learners to develop predictive models for child height based on parental data. Additionally, the inclusion of gender markers allows for an analysis of sexual dimorphism and its impact on height distribution within family units.

Coverage

The scope of the data is derived from Francis Galton's original biometric research conducted in 1885. The demographic focus is specifically on families, documenting the heights of adult children alongside those of their parents. It provides a historical snapshot of late 19th-century anthropometric data, serving as a classic example of early statistical inquiry.

License

CC0: Public Domain

Who Can Use It

Statistics students and educators can leverage these records to teach the foundational principles of regression and correlation. Data science practitioners may utilise the dataset to refine their skills in tabular data analysis and multiple linear modeling. Furthermore, researchers in the fields of biology and the history of science can find this a valuable primary source for studying the origins of biometric statistics.

Dataset Name Suggestions

  • Galton's 1885 Family Height and Heredity Study
  • Multiple Linear Regression Practice: Parental Height
  • Historical Biometric Heights and Gender Archive
  • Galton's Regression Toward the Mean Dataset
  • Intergenerational Stature and Heredity Registry

Attributes

Listing Stats

VIEWS

3

DOWNLOADS

1

LISTED

31/12/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format