Programming Comment Sentiment Analysis Dataset
Software and Technology
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset comprises YouTube comments harvested from programming-related videos, meticulously annotated for sentiment classification. Each comment is categorised as positive (1), negative (0), or indeterminate (-1), providing a valuable resource for natural language processing (NLP) and machine learning applications. The annotation was performed by the dataset creator, establishing a specific context for sentiment analysis.
Columns
- text: The raw YouTube comment itself.
- label: The sentiment classification assigned to the comment, with values of 1 (positive), 0 (negative), or -1 (indeterminate).
Distribution
This dataset contains over 500 unique annotated comments. Specifically, there are 511 unique labelled entries, consisting of 431 positive, 74 negative, and 8 indeterminate comments. While the precise file format for download is not specified, data files on the platform are typically provided in CSV format. The dataset version is 1.0.
Usage
This dataset is ideal for training and testing sentiment analysis models, text classification algorithms, and for tasks requiring binary classification. It can also be utilised for data cleaning exercises and various NLP research projects, especially those focused on user-generated content in the technology domain.
Coverage
The dataset covers YouTube comments from programming videos, offering insights into public sentiment within this specific technological niche. The comments originate from a global region. No specific time range for comment collection is provided.
License
CC0
Who Can Use It
This dataset is particularly useful for data scientists, machine learning engineers, and NLP researchers looking to build or evaluate models for sentiment analysis within a technical context. It is also suitable for students and developers exploring text mining and classification challenges.
Dataset Name Suggestions
- YouTube Programming Comment Sentiment
- Programming Comment Sentiment Analysis Dataset
- Tech Comment Sentiment
- DevTube Sentiment Data
- Code Video Comment Sentiment
Attributes
Original Data Source: 500+ Programming YTB Comments