News classification
Reviews & Ratings
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset contains news headlines along with their corresponding categories. It is designed to classify news articles into different topics such as Business, Sports, Sci/Tech, and World. The dataset offers a useful resource for training machine learning models for text classification and natural language processing tasks, with a focus on the diversity of news genres and subjects.
Dataset Features:
- N_ID: Unique identifier number for news.
- text: The headline of the news article, which concisely summarises the content.
- label: The category or genre of the news article, which could be Business, Sci/Tech, Sports, or World.
Usage:
This dataset is ideal for training and testing machine learning models in the following areas:
- Text classification: Categorising news articles based on the content of the headlines.
- Sentiment analysis: Understanding the sentiment conveyed in different types of news topics.
- Topic modelling: Discovering the common themes in a corpus of news articles.
- Predicting article popularity or readership based on category and content.
Coverage:
The dataset includes news headlines spanning various global topics, providing a broad spectrum of genres. The articles are categorised into Business, Sci/Tech, Sports, and World, which allows for a variety of text mining and natural language processing techniques.
License:
CC0 (Public Domain)
Who can use it:
This dataset is intended for data scientists, machine learning practitioners, researchers, and students interested in exploring text classification, natural language processing, and news analytics.
How to use it:
- Develop models for automated news article categorisation.
- Analyse patterns in different types of news coverage across categories.
- Explore the relationship between article content and its assigned topic label.