Opendatabay APP

Programming Comment Sentiment Analysis Dataset

Software and Technology

Tags and Keywords

Computer

Science

Programming

Text

Classification

Nlp

Data

Cleaning

Binary

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Programming Comment Sentiment Analysis Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset comprises YouTube comments harvested from programming-related videos, meticulously annotated for sentiment classification. Each comment is categorised as positive (1), negative (0), or indeterminate (-1), providing a valuable resource for natural language processing (NLP) and machine learning applications. The annotation was performed by the dataset creator, establishing a specific context for sentiment analysis.

Columns

  • text: The raw YouTube comment itself.
  • label: The sentiment classification assigned to the comment, with values of 1 (positive), 0 (negative), or -1 (indeterminate).

Distribution

This dataset contains over 500 unique annotated comments. Specifically, there are 511 unique labelled entries, consisting of 431 positive, 74 negative, and 8 indeterminate comments. While the precise file format for download is not specified, data files on the platform are typically provided in CSV format. The dataset version is 1.0.

Usage

This dataset is ideal for training and testing sentiment analysis models, text classification algorithms, and for tasks requiring binary classification. It can also be utilised for data cleaning exercises and various NLP research projects, especially those focused on user-generated content in the technology domain.

Coverage

The dataset covers YouTube comments from programming videos, offering insights into public sentiment within this specific technological niche. The comments originate from a global region. No specific time range for comment collection is provided.

License

CC0

Who Can Use It

This dataset is particularly useful for data scientists, machine learning engineers, and NLP researchers looking to build or evaluate models for sentiment analysis within a technical context. It is also suitable for students and developers exploring text mining and classification challenges.

Dataset Name Suggestions

  • YouTube Programming Comment Sentiment
  • Programming Comment Sentiment Analysis Dataset
  • Tech Comment Sentiment
  • DevTube Sentiment Data
  • Code Video Comment Sentiment

Attributes

Original Data Source: 500+ Programming YTB Comments

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

27/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format