Opendatabay APP

California COVID-19 Variant Surveillance

Patient Health Records & Digital Health

Tags and Keywords

Covid-19

California

Variants

Health

Genomic

Trusted By
Trusted by company1Trusted by company2Trusted by company3
California COVID-19 Variant Surveillance Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

The prevalence of circulating SARS-CoV-2 variants throughout California is tracked by the California Department of Public Health (CDPH). This data is compiled by analysing information from the CDPH Genomic Surveillance Data and its communicable disease reporting system, CalREDIE. As viruses naturally mutate, some variants appear and then vanish, while others become common. Specialised laboratories across the state sequence the genomes from a portion of all positive COVID-19 tests to identify which variants are circulating. This process of sequencing and reporting variant results can take several days after a test is confirmed as positive. Understanding which variants are active in California is crucial for informing public health and clinical responses.

Columns

  • date: The date of the data record.
  • area: The geographic area, which is 'California'.
  • area_type: The type of geographic area, which is 'State'.
  • variant_name: The name of the specific COVID-19 variant (e.g., Alpha).
  • specimens: The total number of specimens identified for a particular variant on a given date.
  • percentage: The percentage of a specific variant among all sequenced specimens for a given date.
  • specimens_7d_avg: The seven-day rolling average of the number of specimens for a variant.
  • percentage_7d_avg: The seven-day rolling average of the percentage for a variant.

Distribution

This dataset is provided as a single tabular CSV file, covid19_variant.csv, with a file size of approximately 417.8 kB. It contains 8 columns and a total of 7,790 records.

Usage

This dataset is ideal for analysing public health trends, conducting epidemiological research, and tracking the evolution of COVID-19 variants over time in California. Researchers can model the spread of different variants, and public health officials can use it to inform policy decisions and resource allocation. It is also suitable for creating data visualisations to communicate variant prevalence to the public.

Coverage

The data covers the entire state of California. The records are provided on a daily basis, with 779 unique dates available in the dataset.

License

CC0: Public Domain

Who Can Use It

  • Public Health Officials: For monitoring variant trends and guiding public health interventions.
  • Epidemiologists and Researchers: To study virus mutation, transmission dynamics, and the effectiveness of health measures.
  • Data Journalists and Scientists: For creating reports and visualisations to inform the public about the evolving pandemic.
  • Policy Makers: To develop informed health policies based on the prevalence of specific variants.

Dataset Name Suggestions

  • California COVID-19 Variant Surveillance
  • SARS-CoV-2 Variant Prevalence in California
  • California Daily COVID-19 Variant Tracker
  • Genomic Surveillance of COVID-19 Variants in California

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

24/09/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format