BBC Hindi News Articles Dataset - Detailed
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
The BBC Hindi News Articles Dataset offers a comprehensive collection of news articles gathered through Python web scraping. This dataset features articles from various categories, providing a broad spectrum of content for analysis. Each entry in the dataset includes three key data points:
Headline: The title of the news article.
Content: The full text of the article.
Category: The category to which the article belongs.
Ideal for natural language processing (NLP) tasks, sentiment analysis, and language modeling, this dataset provides a rich resource for understanding and exploring Hindi news media.
I could not find datasets under Creative commons license so I thought of scraping it by myself and making it available on Kaggle!
Please use it freely and just put up credit for the dataset. Upvote would be really appreciated :)
I have also uploaded my jupyter notebook for web scraping on GitHub if you want to check that out:
https://github.com/AadiSrivastava05/BBC-Hindi-News-Dataset-with-web-scraping-script
Original Data Source: BBC Hindi News Articles Dataset - Detailed