French News on Coronavirus
Public Safety & Security
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a collection of approximately 6,000 articles pertaining to COVID-19, sourced from 69 French-speaking news websites. It offers valuable context for studying the media's influence during the pandemic, analysing diverse writing styles, performing sentiment analysis, and for applications in news generation.
Columns
- date_publish: The date when the article was published. Please note that some publication dates may not be entirely accurate.
- title: The headline or title of the news article.
- description: A brief summary or short description of the article's content.
- maintext: The primary body content of the article. Be aware that some content might be truncated due to subscription requirements on the original websites.
- url: The direct web address to the original article.
- labels: Categories or tags assigned to the article, indicating its subject matter.
Distribution
The dataset is typically provided in a CSV file format. It comprises around 6,000 articles. Specific row or record counts for the entire dataset are not detailed, but it offers a substantial collection of news items.
Usage
This dataset is ideal for a variety of analytical and research applications, including:
- Studying the media's impact concerning COVID-19.
- Analysing different writing styles present in news reporting.
- Conducting sentiment analysis on news content.
- Developing and training models for news generation.
Coverage
The dataset's articles are gathered from French-speaking news websites, implying a focus on news relevant to French-speaking regions globally. The content centres on COVID-19, with publication dates included for each article, although some dates may have inaccuracies. The data was collected from a wide array of sources, including major French, Belgian, Canadian, Swiss, and Moroccan news outlets. It's important to note that parts of the main text may be unavailable due to content being behind a paywall.
License
CC0
Who Can Use It
This dataset is suitable for a broad range of users, including:
- Researchers: For academic studies on media, public health, and linguistics.
- Data Scientists: For developing and testing natural language processing (NLP) models, sentiment analysis tools, and text generation algorithms.
- Journalists and Media Analysts: To understand reporting trends and media narratives during the pandemic.
- Public Health Organisations: For insights into public perception and information dissemination during health crises.
Dataset Name Suggestions
- COVID-19 French News Articles
- French-Speaking Pandemic Press
- COVID-19 Media Impact (France)
- French News on Coronavirus
- Francophone COVID-19 Data
Attributes
Original Data Source: COVID-19 - French news dataset