Framingham Heart Disease Risk Data
Patient Health Records & Digital Health
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
The data product features patient metrics derived from an ongoing cardiovascular study conducted in Framingham, Massachusetts. This collection is designed to help researchers predict the 10-year risk of developing future coronary heart disease (CHD) based on a variety of health attributes and patient history. It serves as a valuable resource for predictive modelling in the healthcare domain.
Columns
male
: Indicates patient gender (1 for male, 0 for female).age
: The age of the patient in years.education
: The education level attained by the patient (coded 1 to 4).currentSmoker
: Status indicating if the patient is a current smoker (1=Yes, 0=No).cigsPerDay
: The calculated number of cigarettes smoked per day.BPMeds
: Indicates if the patient is currently on blood pressure medication (1=Yes, 0=No).prevalentStroke
: History of a prior stroke (1=Yes, 0=No).prevalentHyp
: History of hypertension (1=Yes, 0=No).diabetes
: Indicates if the patient has diabetes (1=Yes, 0=No).totChol
: The patient's total cholesterol level.sysBP
: The patient's systolic blood pressure.diaBP
: The patient's diastolic blood pressure.BMI
: The patient's Body Mass Index.heartRate
: The patient's heart rate.glucose
: The patient's glucose level.TenYearCHD
: The target variable, indicating the 10-year risk of coronary heart disease (1=Risk, 0=No Risk).
Distribution
The data is structured as a tabular file, provided in CSV format (
framingham_heart_study.csv
), weighing approximately 191.8 kB. It contains 16 attributes and over 4,000 records, specifically 4,240 valid patient records. The expected frequency of updates for this specific dataset is listed as never.Usage
The data is perfectly suited for developing robust predictive models, particularly in health risk assessment. Ideal applications include:
- Building machine learning classification models to forecast 10-year coronary heart disease risk.
- Statistical analysis of cardiovascular risk factors, such as blood pressure and cholesterol levels.
- Epidemiological research focusing on population health and disease prevalence.
- Educational training for predictive analytics and biostatistics.
Coverage
The data originates from the Framingham Heart Study, an ongoing investigation located in Framingham, Massachusetts. The dataset includes demographic information such as gender, age (ranging roughly from 32 to 70 years), and educational attainment. It captures various clinical health metrics and medical history components, including smoking habits, BMI, and diagnoses of diabetes or hypertension.
License
CC0: Public Domain
Who Can Use It
- Data Scientists: To train and evaluate algorithms for binary classification of disease risk.
- Medical Researchers: For validation of existing risk scoring models and identification of new influential health factors.
- Students and Academics: For hands-on projects involving health data analysis and predictive modelling techniques.
Dataset Name Suggestions
- Framingham Heart Disease Risk Data
- Cardiovascular 10-Year Prediction Metrics
- FHS Patient Clinical Predictors
- Heart Study Massachusetts Data
Attributes
Original Data Source: Framingham Heart Disease Risk Data