Opendatabay APP

Liver Disease Prediction Dataset

Patient Health Records & Digital Health

Tags and Keywords

Liver

Health

Prediction

Disease

Synthetic

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Liver Disease Prediction Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset illuminates factors that influence liver health, offering an opportunity to unveil patterns and predictors in liver well-being. Its primary purpose is to support predictive modelling and the development of healthcare strategies. It is particularly suitable for educational purposes in data science and machine learning projects, enabling users to focus on critical analysis aspects.

Columns

  • Age: Represents the age of individuals, ranging from 20 to 80 years.
  • Gender: A binary indicator where 0 denotes Male and 1 denotes Female.
  • BMI (Body Mass Index): Indicates body mass, with values ranging from 15 to 40.
  • Alcohol Consumption: Measures weekly alcohol intake, from 0 to 20 units per week.
  • Smoking: A binary indicator where 0 signifies No smoking and 1 signifies Yes.
  • Genetic Risk: Categorises genetic predisposition as Low (0), Medium (1), or High (2).
  • Physical Activity: Records weekly physical activity in hours, ranging from 0 to 10.
  • Diabetes: A binary indicator where 0 means No diabetes and 1 means Yes.
  • Hypertension: A binary indicator where 0 means No hypertension and 1 means Yes.
  • Liver Function Test: Provides a score for liver function, ranging from 20 to 100.
  • Diagnosis: A binary outcome (0 or 1) indicating the presence or absence of liver disease.

Distribution

This dataset is provided in a CSV format, typical for tabular data, and is named 'Liver_disease_data.csv'. It contains 1700 records and comprises 11 distinct columns. The file size is 150.24 kB. Its structure includes features that span demographic, lifestyle, and various health indicators, making it well-suited for binary classification tasks.

Usage

This dataset is ideal for a range of applications, including:
  • Research Insights: Exploring correlations between lifestyle/health factors and liver disease, and developing predictive models.
  • Healthcare Interventions: Informing preventive measures and personalising healthcare strategies based on identified risk factors.
  • Data Science and Machine Learning Projects: A suitable resource for developing and fine-tuning predictive models related to liver health.

Coverage

The dataset focuses on demographic indicators such as age (20-80 years) and gender (Male/Female). It also covers various lifestyle and health indicators relevant to liver health. This is a synthetic dataset generated for educational purposes. It does not include geographic or specific time range information, and it's noted that additional external factors impacting liver health are not incorporated.

License

the Attribution 4.0 International (CC BY 4.0)

Who Can Use It

This dataset is intended for:
  • Researchers: Those seeking to explore correlations and build predictive models for liver disease.
  • Healthcare Professionals: Individuals interested in informing preventive measures and personalising healthcare.
  • Data Scientists and Machine Learning Practitioners: Anyone looking for a dataset to develop and fine-tune machine learning models, especially for binary classification problems.

Dataset Name Suggestions

  • Liver Disease Prediction Dataset
  • Synthetic Liver Health Predictor
  • Healthcare Risk Factors Dataset
  • Predictive Liver Health Indicators
  • Machine Learning Liver Disease Data

Attributes

Original Data Source: Liver Disease Prediction Dataset

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

31/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format