Sentiment Analysis Dataset
Data Science and Analytics
Related Searches
Trusted By




"No reviews yet"
Free
About
This dataset is a large-scale collection of 241,000+ English-language comments sourced from various online platforms. Each comment is annotated with a sentiment label:
0 — Negative
1 — Neutral
2 — Positive
The Data has been gathered from multiple websites such as :
Hugginface : https://huggingface.co/datasets/Sp1786/multiclass-sentiment-analysis-dataset
Kaggle : https://www.kaggle.com/datasets/abhi8923shriv/sentiment-analysis-dataset
https://www.kaggle.com/datasets/jp797498e/twitter-entity-sentiment-analysis
https://www.kaggle.com/datasets/crowdflower/twitter-airline-sentiment
The goal is to enable training and evaluation of multi-class sentiment analysis models for real-world text data. The dataset is already preprocessed — lowercase, cleaned from punctuation, URLs, numbers, and stopwords — and is ready for NLP pipelines.
📊 Columns
Column Description
Comment User-generated text content
| Sentiment| Sentiment label (0=Negative, 1=Neutral, 2=Positive) |
🚀 Use Cases
🧠 Train sentiment classifiers using LSTM, BiLSTM, CNN, BERT, or RoBERTa
🔍 Evaluate preprocessing and tokenization strategies
📈 Benchmark NLP models on multi-class classification tasks
🎓 Educational projects and research in opinion mining or text classification
Original Data Source: Sentiment Analysis Dataset
- 🧪 Fine-tune transformer models on a large and diverse sentiment dataset 💬 Example
Comment: "apple pay is so convenient secure and easy to use"
Sentiment: 2 (Positive)