Film Review Binary Sentiment Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a collection of 10,000 text reviews about films, each classified with a binary sentiment label: either positive or negative [1]. It serves as a small version of IMDB text reviews, ideal for machine learning projects focused on sentiment analysis and natural language processing [1, 2].
Columns
- review: This column contains the full text of the movie review itself [1].
- sentiment: This column indicates the sentiment classification for the corresponding review. A value of 0 represents a positive sentiment, while 1 signifies a negative sentiment [1].
Distribution
The dataset comprises 10,000 unique movie reviews [1]. It is typically distributed in a CSV data file format [3]. Out of the total reviews, 5,037 are classified as positive (0) and 4,963 are classified as negative (1), offering a balanced distribution for training purposes [1]. The dataset is available globally [2].
Usage
This dataset is particularly well-suited for a variety of applications, including:
- Training and evaluating sentiment analysis models [1].
- Developing and testing algorithms for binary text classification [4].
- Enhancing Natural Language Processing (NLP) capabilities [4].
- Experimenting with Transformers and PyTorch models for text understanding [4].
Coverage
The dataset focuses on general movie reviews [1, 2]. While specific geographic or demographic details about the original reviewers are not provided, the dataset is listed as having a global region coverage [2]. The listing date for the dataset is noted as 17/06/2025 [2].
License
CCO
Who Can Use It
This dataset is valuable for:
- Data scientists and machine learning engineers who need labelled text data to build and improve sentiment prediction models [1].
- Researchers in the fields of NLP and artificial intelligence exploring text classification techniques [4].
- Students and developers learning about text data processing and sentiment analysis [5].
Dataset Name Suggestions
- IMDB Movie Review Sentiment (10K)
- Film Review Binary Sentiment Dataset
- Movie Sentiment Classification Dataset
- Textual Movie Review Sentiment Analysis
Attributes
Original Data Source: imdb_sentiment_10k_reviews_binary_classification