Quarter-Century Irish Times News
News & Media Articles
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers a unique collection of 1.61 million headlines published by the Irish Times, providing a quarter-century perspective on European events. Spanning 25.5 years, it allows for an in-depth view of news, business, sport, culture, lifestyle, and opinion as reported by an agency with over 160 years of history. The headlines are presented in their original publishing order, requiring minimal processing due to their clean format and category structure.
Columns
- publish_date: The date when the article was published, formatted as
yyyymmdd
. This column contains 1,611,495 valid values. - headline_category: Represents the topic or facet of the article. Values are in ASCII, dot-delimited, and lowercase. There are 103 unique categories, with 'news' being the most common at 36%, followed by 'sport' at 10%, and other categories making up 54%. This column has 1,611,495 valid values.
- headline_text: The actual title of the article, presented in English using UTF-8 character encoding. This column contains 1,611,495 valid values, with 1,516,555 unique headlines and 7 missing values.
Distribution
The dataset is provided as a CSV file, named
ireland-news-headlines.csv
, with a size of 105.99 MB. It contains 1,611,495 records (rows). A separate, additional dataset comprising fifteen months of observational data from Nigeria is also included.Usage
This dataset is ideal for:
- Historical research into European events and news trends over a significant period.
- Natural Language Processing (NLP) tasks, such as text analysis, topic modelling, and sentiment analysis on news headlines.
- Media studies focusing on headline categorisation and changes in reporting over time.
- Linguistic analysis of headline language and evolution.
- Journalism research and analysis of news coverage patterns.
Coverage
The dataset covers a time range from 1st January 1996 to 30th June 2021. Geographically, it focuses primarily on European events as reported by the Irish Times. The core dataset does not include specific demographic scope notes; however, a bonus dataset offers fifteen months of observational data from Nigeria.
License
CC0: Public Domain
Who Can Use It
- Researchers and Academics: For historical, political science, media studies, or linguistic research.
- Data Scientists and NLP Practitioners: For developing and testing text analysis models.
- Journalists and Media Analysts: To understand long-term reporting trends and category distribution.
- Historians: To gain insight into news coverage of European events across a quarter-century.
Dataset Name Suggestions
- Irish Times European Headlines Archive
- Quarter-Century Irish Times News
- European Headline Chronicle 1996-2021
- Irish Times Historical News Dataset
- European News Headlines by Irish Times
Attributes
Original Data Source: Quarter-Century Irish Times News