Opendatabay APP

United States Nationally Notifiable Disease Registry

Patient Health Records & Digital Health

Tags and Keywords

Diseases

Health

History

Surveillance

Epidemiology

Trusted By
Trusted by company1Trusted by company2Trusted by company3
United States Nationally Notifiable Disease Registry Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Monitoring the historical trajectory of infectious diseases within the United States provides a foundational understanding of public health evolution over nearly 100 years. This collection documents weekly reports of eight significant human diseases that were prominent throughout the 20th century. By providing both raw counts and population-adjusted percentages, the data enables researchers to track the effectiveness of medical interventions and the decline of illnesses such as Polio and Smallpox. It serves as a vital historical record for studying the intersection of demographics and contagious disease dynamics.

Columns

  • epi_week: The specific epidemiological week associated with the disease report.
  • state: The identifier for the US state where the data was recorded.
  • loc: The specific location name, which may represent a city or a state.
  • loc_type: A classification categorising the record as either a city-level or state-level report.
  • disease: The name of the specific illness reported, including Diphtheria, Hepatitis A, Measles, Mumps, Pertussis, Polio, Rubella, and Smallpox.
  • cases: The total number of individual disease cases reported in that location for the given week.
  • incidence_per_100000: The calculated incidence rate per 100,000 people, allowing for standardised comparisons across regions with different population sizes.

Distribution

The information is delivered in a single CSV file titled Project_Tycho_Level_1_Data.csv, with a file size of approximately 30.86 MB. It contains exactly 759,000 valid records across 7 distinct columns, with a 100% validity rate and no missing or mismatched entries. This is a static historical archive, and future updates are not expected.

Usage

This resource is ideal for conducting longitudinal epidemiological studies to observe how disease patterns shifted over a 95-year period. It is well-suited for building visualisations that map the geographic spread of outbreaks or for testing time-series models on historical health data. Additionally, health policy analysts can use the metrics to evaluate the long-term impact of national immunisation programmes on the incidence of formerly common diseases.

Coverage

The geographic scope is centred on the United States, including 51 states and 166 unique city and state locations. Temporally, the records provide a wide-reaching view from 1916 through to 2011. The data covers eight specific nationally notifiable diseases, with approximately 79% of the records reported at the state level and 21% at the city level.

License

CC0: Public Domain

Who Can Use It

Epidemiologists and medical historians can leverage these records to study the eradication of major diseases in the 20th century. Data scientists may utilise the large volume of structured entries to train models for anomaly detection in health reporting. Furthermore, students and educators can use the high-integrity data to explore the relationship between urbanisation, geography, and public health outcomes.

Dataset Name Suggestions

  • Century of Contagion: US Disease Surveillance 1916–2011
  • Project Tycho: US Weekly Infectious Disease Historical Archive
  • United States Nationally Notifiable Disease Registry
  • US Historical Disease Metrics: Cases and Incidence Rates
  • Eight Common Human Diseases: 95 Years of US Health Data

Attributes

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

28/12/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format