Filmaffinity Movie Reviews Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a valuable collection of user reviews for Spanish films and series, sourced from www.filmaffinity.com. It was created to enhance the resources available for Natural Language Processing (NLP) in Spanish, an area where research has historically focused on the English language. The dataset is ideal for understanding natural language in Spanish and can be applied to analyses related to public satisfaction, film rating trends, and the critical reception of audiovisual productions. It supports various text analysis methods and statistical studies on content popularity and acceptance.
Columns
- film_name: The title of the film or series that has been reviewed.
- gender: Indicates the genre or genres of the film or series. Records may contain multiple genres, typically separated by commas.
- film_avg_rate: The average rating received by the film or series, generally on a scale of 1 to 10, based on votes from all users.
- review_rate: The specific rating given by the user who wrote the individual review, also on a scale of 1 to 10.
- review_title: The title provided by the user for their review.
- review_text: The full body of the review, detailing the user's opinion on the film or series.
Distribution
This dataset contains reviews for over 1000 Spanish films and series, with more than 10,000 reviews in total. It is structured as a corpus of user-generated text and associated metadata. The data is suitable for import into various analytical platforms and is typically provided in a structured format such as CSV.
Usage
This dataset is particularly useful for:
- Natural Language Processing (NLP) research in Spanish: Developing and testing algorithms for sentiment analysis, text classification, and entity recognition.
- Text analysis: Performing sentiment analysis, generating word clouds, and identifying key themes within user reviews.
- Statistical studies: Analysing patterns in film ratings, exploring correlations between average film ratings and individual review scores, and studying audience reception.
- Content insights: Gaining insights into public satisfaction, tracking rating trends, and understanding critical reception of Spanish audiovisual content.
Coverage
The dataset focuses exclusively on reviews of Spanish movies and series. All reviews are in the Spanish language, gathered from users of www.filmaffinity.com. The data covers a wide array of genres and includes reviews for over 1000 unique titles. The geographic scope for review origin is global, although the content reviewed is Spanish.
License
CCO
Who Can Use It
- NLP Researchers: To build and refine models for Spanish language understanding.
- Data Scientists: For conducting text analytics, sentiment analysis, and predictive modelling based on user reviews.
- Machine Learning Practitioners: To train and validate algorithms for recommendation systems or content popularity prediction.
- Academics and Students: For studies in linguistics, media studies, and artificial intelligence, particularly focusing on the Spanish language.
- Kaggle Users (Spanish-speaking): To share knowledge and collaborate on NLP projects in Spanish.
Dataset Name Suggestions
- Spanish Film and Series Review Corpus
- Filmaffinity Spanish Movie Reviews
- Spanish NLP Film Critiques
- Audiovisual Content Reviews (Spanish)
- Spanish User Film Ratings and Reviews
Attributes
Original Data Source: Críticas Filmaffinity Netflix Español (+10000)