Opendatabay APP

Life Expectancy & Socio-Economic Determinants

Data Science and Analytics

Tags and Keywords

Health

Life

Expectancy

Countries

Economy

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Life Expectancy & Socio-Economic Determinants Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides adjusted and updated information on global health, life expectancy, immunisation, and various economic and demographic indicators for 179 countries, spanning the years 2000 to 2015. Initially sourced from Kaggle, the data has been thoroughly updated and missing values addressed using robust strategies. These strategies include filling data with the closest three-year average if a specific country had a missing value in any year, or with the average of its region (e.g., Africa, Asia, European Union) if values were missing across all years for a country. Countries with more than four missing data columns, such as Sudan, South Sudan, and North Korea, were omitted to maintain data quality. The dataset incorporates a classification for countries based on their Gross National Income per capita, categorising them into high-income, higher-middle-income, lower-middle-income, and low-income groups, aligning with World Bank standards for comparability. Information on population, GDP, and life expectancy has been updated using World Bank data, while vaccination statistics (Measles, Hepatitis B, Polio, Diphtheria), alcohol consumption, BMI, HIV incidents, mortality rates, and thinness figures were collected from World Health Organisation public datasets. Schooling information was gathered from Our World in Data, a University of Oxford project.

Columns

  • Country: A list of 179 distinct countries included in the dataset.
  • Region: Categorises the 179 countries into 9 geographical regions, such as Africa, Asia, Oceania, and the European Union.
  • Year: The observed year, ranging from 2000 to 2015.
  • Infant_deaths: Represents the number of infant deaths per 1,000 population.
  • Under_five_deaths: Represents the number of deaths of children under five years old per 1,000 population.
  • Adult_mortality: Represents the number of deaths of adults per 1,000 population.
  • Alcohol_consumption: Records alcohol consumption in litres of pure alcohol per capita for individuals aged 15 years and over.
  • Hepatitis_B: Represents the percentage of coverage for Hepatitis B (HepB3) immunisation among 1-year-olds.
  • Measles: Represents the percentage of coverage for Measles containing vaccine first dose (MCV1) immunisation among 1-year-olds.
  • BMI: Body Mass Index, a measure of nutritional status in adults (defined as a person's weight in kilograms divided by the square of that person's height in meters).
  • Polio: Represents the percentage of coverage for Polio (Pol3) immunisation among 1-year-olds.
  • Diphtheria: Represents the percentage of coverage for Diphtheria tetanus toxoid and pertussis (DTP3) immunisation among 1-year-olds.
  • Incidents_HIV: Represents the incidents of HIV per 1,000 population aged 15-49.
  • GDP_per_capita: Gross Domestic Product per capita in current US Dollars.
  • Population_mln: Total population of a country in millions.
  • Thinness_ten_nineteen_years: Prevalence of thinness among adolescents aged 10-19 years (specifically, BMI < -2 standard deviations below the median).
  • Thinness_five_nine_years: Prevalence of thinness among children aged 5-9 years (specifically, BMI < -2 standard deviations below the median).
  • Schooling: Average years individuals aged 25 and over have spent in formal education.
  • Economy_status_Developed: A binary indicator (0 or 1) denoting whether a country is classified as 'Developed'.
  • Economy_status_Developing: A binary indicator (0 or 1) denoting whether a country is classified as 'Developing'.
  • Life_expectancy: The average life expectancy for both genders across different years, from 2000 to 2015.

Distribution

The dataset is provided as a CSV file, named "Life-Expectancy-Data-Updated.csv", with a file size of 307.57 kB. It contains 21 variables and 2,864 rows of adjusted data.

Usage

This dataset is ideal for a wide range of analytical applications, including:
  • Analysing global health trends and their determinants.
  • Researching the impact of economic and social factors on life expectancy.
  • Developing predictive models for public health outcomes.
  • Informing policy decisions related to health, education, and economic development.
  • Studying demographic shifts and their correlations with health indicators.

Coverage

The dataset covers 179 countries globally, distributed across 9 distinct regions. The time scope of the data is from the year 2000 to 2015. Demographic coverage varies by indicator, including specific age groups for mortality, immunisation, alcohol consumption, HIV incidence, BMI, and thinness prevalence. Data availability notes indicate that missing values have been filled using sophisticated imputation strategies, and countries with significant data gaps were excluded from the database. Countries are also categorised by their economic status based on Gross National Income per capita.

License

CC0: Public Domain

Who Can Use It

This dataset is suitable for:
  • Public Health Researchers: To study disease prevalence, mortality rates, and the effectiveness of health interventions globally.
  • Economists and Policy Analysts: To examine the socio-economic factors influencing public health and to guide policy formulation.
  • Data Scientists and Machine Learning Engineers: For building and training models related to health predictions, demographic analysis, and trend forecasting.
  • Academics and Students: For educational purposes, research projects, and gaining insights into global health and development.
  • International Organisations: For monitoring global health indicators and assessing progress on development goals.

Dataset Name Suggestions

  • Global Health & Life Expectancy: 2000-2015
  • World Health Indicators & Demographic Data
  • Country-level Health & Development Statistics (2000-2015)
  • Life Expectancy & Socio-Economic Determinants
  • Adjusted Global Health Dataset

Attributes

Listing Stats

VIEWS

3

DOWNLOADS

1

LISTED

24/07/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format