Global Blood Type Distribution by Country
Public Health & Epidemiology
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Explores the global geography of human blood types, detailing the percentage distribution of eight primary blood types (O+, A+, B+, AB+, O-, A-, B-, and AB-) across different countries. The information was gathered by scraping data from the Portuguese Wikipedia article titled "Distribuição do tipo sanguíneo por país." It offers a unique look at public health and demographic variation on a country-by-country basis. While the information is interesting for initial analysis and visualisation, such as creating heatmaps, users should be aware that the size of the data is limited and is not recommended for making serious predictive or statistical modelling inferences.
Columns
The data file, named
bloodtypes.csv, contains 11 distinct columns:- Index: A simple numerical identifier for each record.
- Country: The name of the nation for which the data is recorded. Note that these names currently reflect the original Portuguese source text.
- Population: The total population figure associated with that country.
- O+, A+, B+, AB+, O-, A-, B-, AB-: Eight columns detailing the percentage of the population possessing each specific ABO/Rh blood type within that country.
Distribution
The product is delivered as a single CSV file (
bloodtypes.csv), which is approximately 7.11 kB in size. The structure consists of 11 columns and 100 unique country records. The original data gathering process involved utilising Python tools, specifically the Selenium and BeautifulSoup4 libraries for scraping, with the Pandas library used for structuring the resultant file.Usage
The data is ideally suited for academic or personal projects focusing on exploratory data analysis (EDA) and visualisation. Users can:
- Generate visualisations, such as continental or global heatmaps, illustrating the geographical variation of blood types.
- Perform initial comparisons between population sizes and blood type percentages.
- Use it as a practical sample for teaching or learning basic data scraping, cleaning, and manipulation techniques.
- Inform high-level demographic studies regarding genetic spread.
Coverage
Geographic Scope: The dataset covers 100 countries globally.
Demographic Scope: It provides general population percentages for the standard eight blood types. The data does not currently include details on ethnicity-based distribution, although future expansion in that direction is planned.
Time Range: The information reflects data scraped at a fixed point in time. The expected update frequency for this specific product version is noted as "Never."
License
CC0: Public Domain
Who Can Use It
- Public Health Researchers: For initial investigations into global health trends and genetic markers.
- Data Analysts and Scientists: Seeking a small, clean dataset for practising visualisation or statistical correlation techniques.
- Students and Educators: For use in lessons covering demographics, biology, or data science methodology.
Dataset Name Suggestions
- Global Blood Type Distribution by Country
- Human Blood Types Worldwide
- Country Blood Group Percentages
- Wikipedia Scraped Blood Data
Attributes
Original Data Source: Global Blood Type Distribution by Country
Loading...
