Tuscany COVID-19 Daily Province Metrics
Patient Health Records & Digital Health
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Provides detailed metrics concerning the Covid-19 pandemic across the Tuscany region and its associated provinces in Italy. It captures crucial statistics such as total positive people, cumulative deaths, and daily death percentage variation. The data is structured to allow analysis at both the regional level (Tuscany) and the more granular provincial and medical district levels. This resource is essential for anyone seeking to explore the progression of the virus over time within this specific Italian region, offering robust data for statistical modelling and visualization purposes.
Columns
The data file includes several key indicators:
- Location: Specifies the name of the province or medical district within Tuscany (e.g., AR, FI, aslCENTRO). There are 13 unique location values represented.
- day: A sequential numerical identifier for the day of reporting.
- date: The specific date of the record. There are 843 unique dates covered in the available sample.
- total_number_positive_people: The cumulative count of positive cases recorded from the beginning of the reporting period (maximum value observed around 497k).
- deaths: The cumulative number of people who have died since the beginning of the period (maximum value observed around 4936).
- deaths_increase: Represents the daily variation in deaths expressed as a percentage.
Distribution
The primary data file, typically a CSV, is approximately 448.83 kB in size. The structure contains approximately 11.8 thousand records detailing provincial metrics. The raw data underwent processing, including the removal of columns that did not contain any information, transformation of some numerical data from floats to integers, and the systematic filling of missing values with the integer '0'. Headers were also translated into English to enhance usability.
Usage
This dataset is highly suitable for several analytical applications:
- Exploratory Data Analysis: Ideal for initial investigation using data manipulation libraries such as Pandas or Numpy.
- Visualisation Projects: Supports the creation of compelling charts and graphs to illustrate trends in deaths, recoveries, and current positive counts over time.
- Time Series Analysis: The daily dating allows for effective temporal analysis, including modelling based on date formats.
- Query Practice: An excellent resource for users practising SQL or advanced data filtering techniques using Pandas.
Coverage
The data focuses exclusively on Tuscany, Italy.
- Geographic Scope: Includes metrics for the entire Tuscany region, its ten provinces (AR, FI, GR, LI, LU, MS, PI, PO, PT, SI), and specific medical districts (aslCENTRO, aslNO, aslSE).
- Time Range: The records span from 24 February 2020 to 15 June 2022.
- Update Frequency: No future updates are expected for this dataset.
License
Attribution 4.0 International (CC BY 4.0)
Who Can Use It
This data is intended for a variety of users:
- Data Scientists: To build predictive models or perform rigorous statistical tests on public health data.
- Students and Educators: For teaching data wrangling, cleaning techniques, visualization methods, and database query languages.
- Public Health Researchers: To track the daily impact and historical patterns of the pandemic across distinct administrative areas within Tuscany.
Dataset Name Suggestions
- Tuscany COVID-19 Daily Province Metrics
- Italy Tuscany Region Pandemic Data (2020-2022)
- Tuscan COVID-19 Provincial Health Statistics
Attributes
Original Data Source: Tuscany COVID-19 Daily Province Metrics
Loading...
Free
Download Dataset in ZIP Format
Recommended Datasets
Loading recommendations...
