BBC News Headlines Dataset
News & Media Articles
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This continually updated dataset provides BBC News RSS Feeds, offering a rich source of information for various analytical purposes. It is specifically designed for users interested in sentiment analysis of news headlines and descriptions, and for tracking news content over time. The dataset is generated through a Kernel that regularly collects and updates RSS feeds from the BBC News site.
Columns
- title: The headline or title of the RSS Feed. This column contains 39,653 unique titles out of 42,100 valid entries. A frequent title is "Election poll tracker: How do the parties compare?".
- pubDate: The publication date of the RSS Feed. This is a datetime column, with dates ranging from 30 August 2013 to 4 December 2024. The mean publication date is 30 July 2023, and there are 42,100 valid entries.
- guid: The unique identifier for the RSS Feed. There are 39,203 unique GUIDs out of 42,100 valid entries. A common GUID is "https://www.bbc.co.uk/news/business-61634959".
- link: The direct URL to the news article. This column has 37,856 unique links out of 42,100 valid entries. A frequently occurring link is "https://www.bbc.co.uk/news/business-61634959?at_medium=RSS&at_campaign=KARANGA".
- description: A brief summary or description of the news content. There are 38,731 unique descriptions out of 42,100 valid entries. A common description is "How closely have you been paying attention to what's been going on over the past seven days?".
Distribution
The dataset is provided in CSV format and is approximately 13.54 MB in size. It contains 42,100 records, with all columns having a full set of valid entries.
Usage
This dataset is ideal for:
- Sentiment analysis of news articles using titles and descriptions.
- Trend analysis of news topics over time.
- Monitoring news publication patterns.
- Natural Language Processing (NLP) research and model training.
Coverage
The dataset covers BBC News RSS feeds from 30 August 2013 to 4 December 2024. The content generally reflects global news as reported by the BBC.
License
CC0: Public Domain
Who Can Use It
- Data scientists and analysts for media sentiment research.
- Researchers studying journalism trends and news dissemination.
- Developers creating news aggregation or analysis tools.
- Students undertaking academic projects in text analysis or media studies.
Dataset Name Suggestions
- BBC News RSS Feed Data
- BBC News Headlines Dataset
- Daily BBC News Feed
- Global News RSS Collection
Attributes
Original Data Source: BBC News Headlines Dataset