Opendatabay APP

Heart Failure Mortality Prediction Dataset

Public Health & Epidemiology

Tags and Keywords

Health

Heart

Failure

Mortality

Prediction

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Heart Failure Mortality Prediction Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset focuses on predicting mortality in patients with heart failure, a critical application given that cardiovascular diseases (CVDs) are the leading global cause of death. It contains 12 clinical features that are instrumental in developing machine learning models to identify patients at high risk of death due to heart failure. The data can aid in early detection and management strategies, potentially preventing a significant number of yearly fatalities associated with CVDs.

Columns

  • age: Age of the patient.
  • anaemia: A boolean indicator for the presence of anaemia (decrease of red blood cells or haemoglobin).
  • creatinine_phosphokinase: Level of the CPK enzyme in the blood, measured in mcg/L.
  • diabetes: A boolean indicator for whether the patient has diabetes.
  • ejection_fraction: Percentage of blood leaving the heart at each contraction.
  • high_blood_pressure: A boolean indicator for the presence of hypertension (high blood pressure).
  • platelets: Platelets in the blood, measured in kiloplatelets/mL.
  • serum_creatinine: Level of serum creatinine in the blood, measured in mg/dL.
  • serum_sodium: Level of serum sodium in the blood, measured in mEq/L.
  • sex: A binary indicator for gender (woman or man).
  • smoking: A boolean indicator for whether the patient smokes.
  • time: The follow-up period for the patient, measured in days.
  • DEATH_EVENT: The target variable, a boolean indicator for whether the patient deceased during the follow-up period.

Distribution

The dataset is provided as a CSV file named heart_failure_clinical_records_dataset.csv. It has a file size of 12.24 kB and contains 299 records (rows). All 13 columns are fully populated, with no missing values.

Usage

This dataset is ideal for building and evaluating machine learning models aimed at predicting mortality caused by heart failure. It can be used by researchers and data scientists to:
  • Develop predictive analytics for patient risk stratification.
  • Explore the significance of various clinical features in heart failure outcomes.
  • Create tools for early detection and management of cardiovascular risks.
  • Contribute to healthcare informatics by advancing AI applications in cardiology.

Coverage

The dataset includes clinical records with features such as age and sex (binary for woman or man), providing some demographic scope. The time column specifies a follow-up period in days, indicating the duration of observation for each patient up to 285 days. The dataset does not explicitly state geographic coverage or specific data collection dates.

License

CC BY 4.0

Who Can Use It

This dataset is primarily intended for data scientists, machine learning engineers, and researchers in the fields of healthcare, medical informatics, and public health. It is particularly valuable for those looking to:
  • Build and benchmark mortality prediction models.
  • Analyse clinical features associated with heart failure.
  • Innovate in preventative healthcare using data-driven insights.

Dataset Name Suggestions

  • Heart Failure Mortality Prediction Dataset
  • Cardiovascular Death Prediction Data
  • Clinical Heart Failure Outcomes
  • Patient Heart Failure Survival Records
  • CVD Mortality Risk Dataset

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

08/07/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free