Childhood Allergy Insights Dataset
Public Health & Epidemiology
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is designed to help understand the prevalence and treatment outcomes of childhood allergies over an extended period. It provides retrospective data, as reported by healthcare providers, on individuals with asthma, atopic dermatitis, allergic rhinitis, and various food allergies. The dataset also includes columns that enable insights into how these outcomes differ across various demographic factors like gender, race, and ethnicity. By examining this data, patterns and trends among diagnosed cases can be recognised, potentially leading to new treatments and prevention strategies for severe allergic reactions in children globally.
Columns
The dataset contains 50 columns in total, including:
- SUBJECT_ID: A unique identifier for each patient. (Integer). Example: Valid 333,000 records.
- BIRTH_YEAR: Year of birth of the patient. (Integer). Range: 1983 to 2012. The most common years are 2000-2005, with high counts (e.g., 17,600 for 2003.88 - 2004.46).
- GENDER_FACTOR: Gender of the patient. (String). S0 - Male (51%), S1 - Female (49%).
- RACE_FACTOR: Race of the patient. (String). R0 - White (55%), R1 - Black (29%), Other (16%).
- ETHNICITY_FACTOR: Ethnicity of the patient. (String). E0 - Non-Hispanic (95%), E1 - Hispanic (5%).
- PAYER_FACTOR: Insurance coverage of the patient. (String). P0 - Non-Medicaid (74%), P1 - Medicaid (26%).
- ATOPIC_MARCH_COHORT: Cohort of the patient. (String). False (91%), True (9%).
- AGE_START_YEARS: Age of the patient at the start of the study. (Integer). Range: -4.31 to 18 years. The most common age range at the start is -0.30 to 0.15 years (125,480 records).
- AGE_END_YEARS: Age of the patient at the end of the study. (Integer). Range: 1 to 19 years. The most common age range at the end is 18.64 to 19.00 years (15,961 records).
- SHELLFISH_ALG_START: Shellfish allergy status at the start of the study. (String). NA (98%).
- SHELLFISH_ALG_END: Shellfish allergy status at the end of the study. (String). NA (100%).
- FISH_ALG_START: Fish allergy status at the start of the study. (String). NA (99%).
- FISH_ALG_END: Fish allergy status at the end of the study. (String). NA (100%).
- MILK_ALG_START: Milk allergy status at the start of the study. (String). NA (98%).
- MILK_ALG_END: Milk allergy status at the end of the study. (String). NA (99%).
- SOY_ALG_START: Soy allergy status at the start of the study. (String). NA (99%).
- SOY_ALG_END: Soy allergy status at the end of the study. (String). NA (100%).
- EGG_ALG_START: Egg allergy status at the start of the study. (String). NA (98%).
- EGG_ALG_END: Egg allergy status at the end of the study. (String). NA (99%).
- WHEAT_ALG_START: Wheat allergy status at the start of the study. (String). NA (100%).
- WHEAT_ALG_END: Wheat allergy status at the end of the study. (String). NA (100%).
- PEANUT_ALG_START: Peanut allergy status at the start of the study. (String). NA (97%).
- PEANUT_ALG_END: Peanut allergy status at the end of the study. (String). NA (99%).
- SESAME_ALG_START: Sesame allergy status at the start of the study. (String). NA (100%).
- SESAME_ALG_END: Sesame allergy status at the end of the study. (String). NA (100%).
- TREENUT_ALG_START: Tree nut allergy status at the start of the study. (String). NA (100%).
- TREENUT_ALG_END: Tree nut allergy status at the end of the study. (String). NA (100%).
- WALNUT_ALG_START: Walnut allergy status at the start of the study. (String). NA (100%).
- WALNUT_ALG_END: Walnut allergy status at the end of the study. (String). NA (100%).
- PECAN_ALG_START: Pecan allergy status at the start of the study. (String). NA (100%).
- PECAN_ALG_END: Pecan allergy status at the end of the study. (String). NA (100%).
- PISTACH_ALG_START: Pistachio allergy status at the start of the study. (String). NA (100%).
- PISTACH_ALG_END: Pistachio allergy status at the end of the study. (String). NA (100%).
- ALMOND_ALG_START: Almond allergy status at the start of the study. (String). NA (100%).
- ALMOND_ALG_END: Almond allergy status at the end of the study. (String). NA (100%).
- BRAZIL_ALG_START: Brazil nut allergy status at the start of the study. (String). NA (100%).
- BRAZIL_ALG_END: Brazil nut allergy status at the end of the study. (String). NA (100%).
- HAZELNUT_ALG_START: Hazelnut allergy status at the start of the study. (String). NA (100%).
- HAZELNUT_ALG_END: Hazelnut allergy status at the end of the study. (String). NA (100%).
- CASHEW_ALG_START: Cashew allergy status at the start of the study. (String). NA (100%).
- CASHEW_ALG_END: Cashew allergy status at the end of the study. (String). NA (100%).
- ATOPIC_DERM_START: Atopic dermatitis status at the start of the study. (String). NA (85%).
- ATOPIC_DERM_END: Atopic dermatitis status at the end of the study. (String). NA (87%).
- ALLERGIC_RHINITIS_START: Allergic rhinitis status at the start of the study. (String). NA (83%).
- ALLERGIC_RHINITIS_END: Allergic rhinitis status at the end of the study. (String). NA (92%).
- ASTHMA_START: Asthma status at the start of the study. (String). NA (81%).
- ASTHMA_END: Asthma status at the end of the study. (String). NA (92%).
- FIRST_ASTHMARX: First asthma medication prescribed. (String). NA (65%).
- LAST_ASTHMARX: Last asthma medication prescribed. (String). NA (65%).
- NUM_ASTHMARX: Number of asthma medications prescribed. (Integer). NA (65%), 1 (11%).
Distribution
The dataset is provided in a CSV file format, specifically named
food-allergy-analysis-Zenodo.csv
. It has a file size of 87.74 MB and consists of 50 columns. The dataset contains a total of 333,000 records.Usage
This dataset is ideal for investigating questions related to childhood allergies. Potential applications and use cases include:
- Identifying risk factors or patterns in childhood allergies to inform preventative and treatment measures.
- Investigating correlations between demographic characteristics (such as age and gender) and the diagnosis or severity of childhood allergies using statistical methods like cross-tabulations.
- Analysing longitudinal trends in treatment outcomes for various types of childhood allergy, including asthma, atopic dermatitis, and food allergy, by comparing patient results over time (e.g., pre-treatment vs. post-treatment diagnoses).
- Conducting descriptive analysis of allergy prevalence or seeking correlations between different conditions.
Coverage
The dataset covers patients born between 1983 and 2012. The patient ages within the study range from -4.31 years at the start to 19 years at the end of the study period.
Demographic scope includes:
- Gender: Male and Female.
- Race: White, Black, and other categories.
- Ethnicity: Non-Hispanic and Hispanic.
- Insurance Coverage: Non-Medicaid and Medicaid.
The data consists of retrospective reports from healthcare providers, which may have potential sources of bias such as difficulties in disease identification (e.g., misdiagnosis) or unreported cases due to lack of healthcare access or awareness. To help reduce bias, it is suggested to use the largest possible datasets.
License
CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
Who Can Use It
This dataset is suitable for:
- Public health researchers and epidemiologists: To study the prevalence, demographic patterns, and trends of childhood allergies.
- Medical professionals and healthcare analysts: To understand treatment outcomes and identify areas for improved patient care strategies.
- Data scientists and statisticians: For conducting in-depth analyses, building predictive models, and exploring correlations between various factors and allergy outcomes.
- Organisations developing new treatments or prevention strategies: To identify risk factors and areas of unmet need in allergy management.
Dataset Name Suggestions
- Childhood Allergy Insights Dataset
- Paediatric Allergy Trends Data
- Allergy Prevalence and Treatment Outcomes for Children
- Demographic Study of Childhood Allergies
Attributes
Original Data Source: Childhood Allergy Insights Dataset`