Opendatabay APP

Corona News Analysis Data

Public Safety & Security

Tags and Keywords

Education

News

Programming

Nlp

Deep

Learning

Public

Safety

Coronavirus

Feature

Engineering

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Corona News Analysis Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides news content related to the 2019–20 coronavirus outbreak, which the World Health Organization (WHO) declared a pandemic and a Public Health Emergency of International Concern (PHEIC). Coronaviruses are a large family of viruses, and a novel coronavirus, COVID-19, identified in Wuhan, China in 2019, spread globally, causing the pandemic. The purpose of this data is to help data scientists gain insight into the spread of COVID-19 around the world through news analysis.

Columns

  • date: Specifies the publication date of the news item.
  • title: Represents the headline or title of the news post.
  • category: Indicates the classification or subject category of the post.
  • body: Contains the main content or text of the news article.
  • source: Identifies the origin or provider from which the news data was collected.

Distribution

The dataset is structured in a CSV format and contains 1009 rows or records. Regarding date distribution, approximately 11% of the news was published on March 17, and 10% on March 12. There are 954 unique date values represented within the dataset. For news categories, 19% fall under "Business" and 19% under "U.S.". The dataset includes 942 unique category values. In terms of news sources, 48% of the content is "By Reuters", and 34% is "By The Associated Press".

Usage

This dataset is ideal for a variety of applications, including:
  • Analysing the global dissemination of news related to COVID-19.
  • Natural Language Processing (NLP) tasks such as text classification, sentiment analysis, or topic modelling on news content.
  • Applying deep learning techniques for insights into public discourse and information spread during a health crisis.
  • Feature engineering for broader data science projects concerning pandemic communication.

Coverage

The dataset's geographic scope is global, reflecting the worldwide spread of the COVID-19 pandemic. The time range includes specific dates such as March 12 and March 17, alongside a large number of other dates, indicating coverage during the 2019–2020 coronavirus period.

License

CC0

Who Can Use It

This dataset is primarily intended for data scientists who wish to gain insights into the spread of COVID-19 through news media. It is also suitable for researchers, public safety analysts, and anyone involved in NLP or deep learning projects focused on news content or health emergencies.

Dataset Name Suggestions

  • Coronavirus News (COVID-19)
  • Global Pandemic News Content
  • COVID-19 News Articles Dataset
  • Corona News Analysis Data

Attributes

Original Data Source: Coronavirus News (COVID-19)

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

27/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format