Opendatabay APP

Multi-Source Heart Condition Database

Patient Health Records & Digital Health

Tags and Keywords

Heart

Cardiology

Disease

Prediction

Clinical

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Multi-Source Heart Condition Database Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Curated combination of five widely used heart disease datasets, previously available independently on the UCI Machine Learning Repository. Merged based on 11 common clinical features, this resource creates a large, unified collection of records for research. By consolidating diverse sources including Cleveland, Hungarian, Switzerland, Long Beach VA, and Statlog (Heart), it enables broader generalisation, improved pattern detection, and supports robust training for machine learning models aiming to assist in early diagnosis and clinical decision-making in cardiology.

Columns

  • age: Patient age in years (Range: 29 to 77).
  • sex: Gender classification (Values: 0, 1).
  • cp: Chest pain type classification (Values: 0, 1, 2, 3).
  • trestbps: Resting blood pressure (Range: 94 to 200).
  • chol: Serum cholesterol in mg/dl (Range: 126 to 564).
  • fbs: Fasting blood sugar > 120 mg/dl (Values: 0, 1).
  • restecg: Resting electrocardiographic results (Values: 0, 1, 2).
  • thalach: Maximum heart rate achieved (Range: 71 to 202).
  • exang: Exercise induced angina (Values: 0, 1).
  • oldpeak: ST depression induced by exercise relative to rest (Range: 0 to 6.2).
  • slope: Slope of the peak exercise ST segment (Values: 0, 1, 2).
  • ca: Number of major vessels coloured by fluoroscopy (Values: 0 to 4).
  • thal: Thalassemia indicator (Values: 0 to 3).
  • target: Diagnosis of heart disease/presence of coronary artery disease (0 = Absence, 1 = Presence).

Distribution

  • Format: CSV (cardiac arrest dataset.csv).
  • Size: 38.11 kB.
  • Rows: 1,025 valid patient instances.
  • Columns: 14 clinical and diagnostic attributes.

Usage

  • Training and evaluating machine learning models for heart disease prediction.
  • Developing clinical decision support systems for early diagnosis.
  • Analysing trends in cardiac health across different demographics.
  • Educational use for data science and medical statistics.

Coverage

  • Geographic Scope: Aggregated from five international sources: Cleveland, Hungary, Switzerland, Long Beach VA, and Statlog.
  • Demographic Range: Patients aged between 29 and 77 years.
  • Data Availability: Unified dataset containing 11 common clinical features merged from the original independent sources.

License

CC0: Public Domain

Who Can Use It

  • Data Scientists and Machine Learning Engineers.
  • Medical Researchers and Cardiologists.
  • Public Health Analysts.
  • Students and Educators in Health Informatics.

Dataset Name Suggestions

  • Unified Heart Disease Indicators
  • Consolidated Cardiac Arrest Factors
  • Global Heart Health Records
  • Merged Clinical Cardiac Dataset
  • Multi-Source Heart Condition Database

Attributes

Listing Stats

VIEWS

1

DOWNLOADS

1

LISTED

06/12/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format