Opendatabay APP

Topic_classification_dataset

Data Science and Analytics

Tags and Keywords

news

text

classification

nlp

multiclass

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Topic_classification_dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

I made this dataset from other datasets so I can make it easier to deal with topic classification it contains 6 topics :
Politics Health Emotion Financial Sport Science the content of the topics are news , articles ,answers or comments
  1. the file "topic_classification_data.csv" have the original text data
  2. the file "2CLEAN" have the same text data but with NLP processing applied on the text
the NPL processing steps are :
  1. Text cleaning: -Normalize the text.
-Remove punctuation marks.
-Remove stop words.
-Remove HTML tags.
-Remove special characters.
-Remove emojis.
-Fix contractions.
  1. POS tagging 3.Lemmatization

License

CC0
Original Data Source: Topic_classification_dataset

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

22/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free