Opendatabay APP

Global Movie Popularity and Rating Data

Product Reviews & Feedback

Tags and Keywords

Movies

Ratings

Genres

Popularity

Cinema

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Global Movie Popularity and Rating Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Dive into the fascinating realm of cinema with a rich collection of detailed information concerning 10,000 films. This dataset serves as a valuable resource for data analysis, offering insights into cinematic trends, audience reception, and the talented individuals who create these works. The collection includes titles spanning both classic and contemporary eras, sourced primarily from The Movie Database (TMDB) API.

Columns

  • adult: Boolean indicator if the film is classified for adult audiences.
  • backdrop_path: URL link to the movie's cover poster image.
  • genre_ids: Identifiers corresponding to the genres associated with the film.
  • id: Unique identification number assigned to the movie.
  • original_language: The primary language in which the film was produced, such as 'en' or 'fr'.
  • original_title: The title of the film as originally released.
  • overview: A brief synopsis or summary providing insight into the movie's plot.
  • popularity: A numerical score reflecting the current popularity of the film.
  • poster_path: URL link to the main movie poster image.
  • release_date: The specific date the film was first released.
  • title: The commonly known or translated title of the movie.
  • vote_average: The mean score derived from audience votes.
  • vote_count: The total number of votes the film has received.
  • keywords: Specific thematic terms associated with the film.
  • cast: Details regarding the principal cast members.
  • crew: Information concerning the production crew.

Distribution

The data is provided in a clean CSV file format, labelled CleanedTMDB1000.csv. It comprises 10,000 individual movie records across 18 distinct attributes and has a total file size of 281.36 MB. The structure is well-maintained, exhibiting a high degree of data validity across all core fields.

Usage

  • Building personalised movie recommendation engines based on genre and audience preference patterns.
  • Developing machine learning models to estimate a film's future popularity based on its attributes.
  • Conducting detailed data analysis on genre performance, audience voting behaviour, and release trends over time.
  • Creating visualisations that showcase insights into movie distribution and critical reception.

Coverage

The dataset covers movie releases spanning over a century, beginning with titles released as early as June 1895 and extending through June 2023. While the collection is globally sourced and includes 46 unique original languages, the dominant language represented is English, accounting for 76% of the records. The scope includes 10,000 diverse cinematic works.

License

Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

Who Can Use It

  • Data Analysts: For investigating relationships between cinematic attributes, audience reception, and temporal trends.
  • Machine Learning Engineers: For training and validating predictive models related to film success metrics.
  • Film Critics/Academics: For studying the evolution of genres and the influence of cast and crew across various cinematic eras.
  • Developers: For building applications that require rich, structured metadata on historical and current film titles.

Dataset Name Suggestions

  • TMDB 10K Movie Attributes
  • Cinematic Trends and Insights Dataset
  • Global Movie Popularity and Rating Data
  • 10,000 Film Data Collection.

Attributes

Listing Stats

VIEWS

8

DOWNLOADS

1

LISTED

04/10/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format