Country-Level Obesity Death Data
Public Health & Epidemiology
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Data on the projection of deaths linked to obesity and overweight across various countries. This dataset aims to support the estimation of mortality rates attributable to high Body Mass Index (BMI) using historical data, making it ideal for regression problems and predictive modelling in global health. The dataset includes separate files for training, testing, and answering specific projection queries.
Columns
The primary
answer.csv
file includes the following columns:- Country: The name of the country. There are 205 unique country names represented.
- Year: The year for which the data is provided, which is consistently 2014.
- Death-By-High-BMI: Represents the number of deaths from all causes attributed to a high Body Mass Index, expressed per 100,000 people. This metric is age-standardised and covers both sexes.
Other variables mentioned for the broader dataset include country codes, population, percentage of the population defined as overweight (BMI >= 25, age-standardised, both sexes, 18+), mean daily caloric supply among overweight or obese men, mean BMI (for both male and female, 18+ years), percentage of overweight adults (age-standardised, 18+ years, both male and female), prevalence of hypertension among adults aged 30-79 years (age-standardised, both male and female), and prevalence of obesity (BMI >= 30, age-standardised, both sexes, 18+).
Distribution
The dataset is typically provided in a CSV format. The
answer.csv
file contains 205 records, with each record representing a country. This particular file is approximately 5.19 kB. The 'Year' column consistently shows '2014' for all entries. The 'Death-By-High-BMI' column presents values ranging from approximately 133,000 to 3.18 million, with an average of around 968,000 deaths and a standard deviation of about 583,000. All 205 records in answer.csv
are valid and complete for all listed columns.Usage
This data is well-suited for regression analysis to estimate future mortality trends related to obesity. It can be used for forecasting health outcomes, conducting exploratory data analysis on global health patterns, and supporting research into social issues linked to public health. The pre-divided TRAIN, TEST, and ANSWER data facilitate immediate work on regression problems.
Coverage
The dataset covers 205 distinct countries globally. The
answer.csv
specifically provides data for the year 2014. The related training data (train.csv
) covers the period from 1990 to 2013, while test data (test.csv
) provides explanatory variables for 2014. Additionally, "hint" data (hint.csv
) is available for 2015 and beyond, which may be useful for forecasting despite having some missing values. The demographic scope generally includes both sexes and individuals aged 18 and over, with some variables specifically detailing men or an age range of 30-79 years for hypertension prevalence.License
CC BY-SA 4.0
Who Can Use It
- Data Scientists: For developing and testing regression models to predict health outcomes and analyse health data.
- Public Health Researchers: To analyse the impact of obesity on mortality rates across different nations and understand global health trends.
- Policy Makers: To inform strategies for public health interventions, resource allocation, and address social issues related to health.
- Students: For educational projects focusing on data analytics, global health, and social issues.
- Forecasters: Individuals interested in recent data for future health trend predictions, particularly utilising the additional hint data.
Dataset Name Suggestions
- Global Obesity Mortality Projections
- Country-Level Obesity Death Data
- BMI Mortality Forecasts
- Health Burden of Obesity
- National Obesity-Related Deaths
Attributes
Original Data Source: Country-Level Obesity Death Data