Opendatabay APP

Filmaffinity Movie Reviews Dataset

Entertainment & Media Consumption

Tags and Keywords

Earth

Arts

Movies

Data

Nlp

Websites

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Filmaffinity Movie Reviews Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a valuable collection of user reviews for Spanish films and series, sourced from www.filmaffinity.com. It was created to enhance the resources available for Natural Language Processing (NLP) in Spanish, an area where research has historically focused on the English language. The dataset is ideal for understanding natural language in Spanish and can be applied to analyses related to public satisfaction, film rating trends, and the critical reception of audiovisual productions. It supports various text analysis methods and statistical studies on content popularity and acceptance.

Columns

  • film_name: The title of the film or series that has been reviewed.
  • gender: Indicates the genre or genres of the film or series. Records may contain multiple genres, typically separated by commas.
  • film_avg_rate: The average rating received by the film or series, generally on a scale of 1 to 10, based on votes from all users.
  • review_rate: The specific rating given by the user who wrote the individual review, also on a scale of 1 to 10.
  • review_title: The title provided by the user for their review.
  • review_text: The full body of the review, detailing the user's opinion on the film or series.

Distribution

This dataset contains reviews for over 1000 Spanish films and series, with more than 10,000 reviews in total. It is structured as a corpus of user-generated text and associated metadata. The data is suitable for import into various analytical platforms and is typically provided in a structured format such as CSV.

Usage

This dataset is particularly useful for:
  • Natural Language Processing (NLP) research in Spanish: Developing and testing algorithms for sentiment analysis, text classification, and entity recognition.
  • Text analysis: Performing sentiment analysis, generating word clouds, and identifying key themes within user reviews.
  • Statistical studies: Analysing patterns in film ratings, exploring correlations between average film ratings and individual review scores, and studying audience reception.
  • Content insights: Gaining insights into public satisfaction, tracking rating trends, and understanding critical reception of Spanish audiovisual content.

Coverage

The dataset focuses exclusively on reviews of Spanish movies and series. All reviews are in the Spanish language, gathered from users of www.filmaffinity.com. The data covers a wide array of genres and includes reviews for over 1000 unique titles. The geographic scope for review origin is global, although the content reviewed is Spanish.

License

CCO

Who Can Use It

  • NLP Researchers: To build and refine models for Spanish language understanding.
  • Data Scientists: For conducting text analytics, sentiment analysis, and predictive modelling based on user reviews.
  • Machine Learning Practitioners: To train and validate algorithms for recommendation systems or content popularity prediction.
  • Academics and Students: For studies in linguistics, media studies, and artificial intelligence, particularly focusing on the Spanish language.
  • Kaggle Users (Spanish-speaking): To share knowledge and collaborate on NLP projects in Spanish.

Dataset Name Suggestions

  • Spanish Film and Series Review Corpus
  • Filmaffinity Spanish Movie Reviews
  • Spanish NLP Film Critiques
  • Audiovisual Content Reviews (Spanish)
  • Spanish User Film Ratings and Reviews

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

08/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free