United States Nationally Notifiable Disease Registry
Patient Health Records & Digital Health
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Monitoring the historical trajectory of infectious diseases within the United States provides a foundational understanding of public health evolution over nearly 100 years. This collection documents weekly reports of eight significant human diseases that were prominent throughout the 20th century. By providing both raw counts and population-adjusted percentages, the data enables researchers to track the effectiveness of medical interventions and the decline of illnesses such as Polio and Smallpox. It serves as a vital historical record for studying the intersection of demographics and contagious disease dynamics.
Columns
- epi_week: The specific epidemiological week associated with the disease report.
- state: The identifier for the US state where the data was recorded.
- loc: The specific location name, which may represent a city or a state.
- loc_type: A classification categorising the record as either a city-level or state-level report.
- disease: The name of the specific illness reported, including Diphtheria, Hepatitis A, Measles, Mumps, Pertussis, Polio, Rubella, and Smallpox.
- cases: The total number of individual disease cases reported in that location for the given week.
- incidence_per_100000: The calculated incidence rate per 100,000 people, allowing for standardised comparisons across regions with different population sizes.
Distribution
The information is delivered in a single CSV file titled
Project_Tycho_Level_1_Data.csv, with a file size of approximately 30.86 MB. It contains exactly 759,000 valid records across 7 distinct columns, with a 100% validity rate and no missing or mismatched entries. This is a static historical archive, and future updates are not expected.Usage
This resource is ideal for conducting longitudinal epidemiological studies to observe how disease patterns shifted over a 95-year period. It is well-suited for building visualisations that map the geographic spread of outbreaks or for testing time-series models on historical health data. Additionally, health policy analysts can use the metrics to evaluate the long-term impact of national immunisation programmes on the incidence of formerly common diseases.
Coverage
The geographic scope is centred on the United States, including 51 states and 166 unique city and state locations. Temporally, the records provide a wide-reaching view from 1916 through to 2011. The data covers eight specific nationally notifiable diseases, with approximately 79% of the records reported at the state level and 21% at the city level.
License
CC0: Public Domain
Who Can Use It
Epidemiologists and medical historians can leverage these records to study the eradication of major diseases in the 20th century. Data scientists may utilise the large volume of structured entries to train models for anomaly detection in health reporting. Furthermore, students and educators can use the high-integrity data to explore the relationship between urbanisation, geography, and public health outcomes.
Dataset Name Suggestions
- Century of Contagion: US Disease Surveillance 1916–2011
- Project Tycho: US Weekly Infectious Disease Historical Archive
- United States Nationally Notifiable Disease Registry
- US Historical Disease Metrics: Cases and Incidence Rates
- Eight Common Human Diseases: 95 Years of US Health Data
Attributes
Original Data Source: United States Nationally Notifiable Disease Registry
Loading...
Free
Download Dataset in CSV Format
Recommended Datasets
Loading recommendations...
