Opendatabay APP

Geolocated Daily European Coronavirus Data

Patient Health Records & Digital Health

Tags and Keywords

Covid

Europe

Cases

Latitude

Incomplete

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Geolocated Daily European Coronavirus Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This tabular data product offers records of Covid-19 cases tracked over time across various geographic locations within the European continent. The collection is classified as an incomplete dataset focusing on daily case metrics and includes geographical coordinates (latitude and longitude) rather than specific city names. This time-series resource is ideal for analysis spanning the years 2020 through to mid-2022.

Columns

The data file contains 13 distinct columns:
  • index: A sequential number.
  • date: The date of the report.
  • cases: The reported number of cases (maximum value observed is 34.9k).
  • country: The country associated with the record (12 unique values observed).
  • qry: A unique identifier for each geographical location.
  • lat: The latitude coordinate for the location (ranging from approximately -21.1 to 60.2).
  • long: The longitude coordinate for the location (ranging from approximately -63.1 to 55.5).
  • dayofyear: The Julian day representation (ranging from 1 to 366).
  • year: The year of the record (from 2020 to 2022).
  • lengthofday: The approximate length of the day in hours.
  • delta: The difference in day length between consecutive days.
  • delta2: The day length double difference between consecutive days.
  • normilized_cases: Case count normalized by the maximum number of cases per year (ranging from 0 to 1).

Distribution

The primary data file, named europe.csv, is provided in a standard tabular format suitable for time series analysis. The file size is 272.52 MB and contains over 1.43 million validated records across all columns.

Usage

This dataset is valuable for applications such as:
  • Time Series Analysis: Modelling the progression of Covid-19 case counts over specific periods.
  • Geospatial Studies: Investigating potential correlations between case distribution and geographical factors like latitude and longitude.
  • Environmental Correlation: Exploring the relationship between epidemiological data and temporal/environmental variables, such as day length.
  • Historical Tracking: Reviewing the regional impact of the pandemic across parts of Europe from 2020 onwards.

Coverage

The data focuses on various locations within Europe. It is explicitly noted as an incomplete representation of the entire continent. Location identifiers (city names) have been replaced by geographic locations, assigned a unique identifier (qry). The temporal coverage spans from 1 January 2020 to 2 June 2022. A total of 12 unique countries are included, with the United Kingdom being the most frequently recorded country, accounting for 23% of the records.

License

Attribution 4.0 International (CC BY 4.0)

Who Can Use It

Intended users include data scientists, academic researchers in public health or epidemiology, and geospatial analysts. They can use the data for predictive modelling, exploratory data analysis of pandemic patterns, and understanding the geographical and temporal distribution of disease spread.

Dataset Name Suggestions

  • European COVID-19 Case Time Series (2020–2022)
  • Geolocated Daily European Coronavirus Data
  • European Case Metrics with Day Length Attributes

Attributes

Listing Stats

VIEWS

3

DOWNLOADS

0

LISTED

02/11/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format