United States Bee Colony Health Dataset
Data Science and Analytics
Tags and Keywords
Trusted By



"No reviews yet"
Free
About
This dataset is designed to facilitate the analysis of honey bee populations in the United States between 2015 and 2022. It offers critical insights into the alarming decline of international honey bee populations, which are vital pollinators, often attributed to factors such as climate change, parasites, and diseases. The data, originally collected by the USDA, has been curated to enable exploratory data analysis (EDA) and predictive modelling, helping users identify important trends in bee colony health and decline.
Columns
- state: The specific state within the USA. Note: 'Other' represents a collection of states for privacy, and 'United States' signifies the average across all states.
- num_colonies: The total number of honey bee colonies.
- max_colonies: The maximum number of honey bee colonies recorded for a given quarter.
- lost_colonies: The count of colonies that were lost during a specific quarter.
- percent_lost: The percentage of honey bee colonies lost during that quarter.
- renovated_colonies: The number of colonies that underwent 'requeening' or received new bees.
- percent_renovated: The percentage of honey bee colonies that were renovated.
- quarter: The quarter of the year, with Q1 being January to March, Q2 April to June, Q3 July to September, and Q4 October to December.
- year: The specific year the data pertains to, ranging from 2015 to 2022.
- varroa_mites: The percentage of colonies affected by Varroa mites, a species known to impact honey bee populations.
- other_pests_and_parasites: The percentage of colonies affected by a collection of other harmful pests and parasites.
- diseases: The percentage of colonies affected by certain diseases.
- pesticides: The percentage of colonies affected by the use of certain pesticides.
- other: The percentage of colonies affected by unlisted causes.
- unknown: The percentage of colonies affected by an unknown cause.
Distribution
The dataset is provided in a CSV (Comma Separated Values) format, with the file named
save_the_bees.csv
. It has a file size of 112.01 kB and contains 17 distinct columns. The data includes 1453 records across all columns.Usage
This dataset is ideal for:
- Exploratory Data Analysis (EDA) to uncover patterns and anomalies in honey bee population trends.
- Predictive modelling to forecast future population changes or identify risk factors.
- Environmental research focused on biodiversity, ecosystem health, and pollinator decline.
- Agricultural studies examining the impact of pests, diseases, and pesticides on bee colonies.
- Policy development for conservation efforts and agricultural regulations.
Coverage
The data encompasses honey bee populations within the United States, covering the period from 2015 to 2022. The geographic scope includes individual states, with a collective category for 'Other' states (for privacy) and an overall 'United States' average. No specific notes on data availability for certain groups/years beyond the explicit time range are mentioned.
License
CC0: Public Domain
Who Can Use It
- Data scientists and analysts for trend identification, statistical analysis, and model building.
- Environmental researchers and biologists studying pollinator health, ecological impacts, and conservation strategies.
- Government agencies and policymakers involved in agriculture, environmental protection, and public health.
- Students and academics for educational purposes, research projects, and scientific publications related to entomology or environmental science.
- Agricultural industry stakeholders to understand threats to pollination services.
Dataset Name Suggestions
- US Honey Bee Population Analysis 2015-2022
- United States Bee Colony Health Dataset
- North American Honey Bee Decline Data
- USDA Honey Bee Colony Trends
- Pollinator Population Statistics USA
Attributes
Original Data Source: United States Bee Colony Health Dataset