Popular TMDB Movies Dataset
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers an insight into over 9,500 popular movies from The Movie Database (TMDB), a prominent source for film and TV information [1, 2]. It includes essential details such as titles, release dates, genres, original languages, vote averages, vote counts, popularity scores, overviews, budgets, and runtimes [1]. The dataset is a valuable resource for anyone keen to explore trends, patterns, and audience preferences within the global entertainment industry [1]. It is well-suited for enhancing movie recommendation systems, performing sentiment analysis, or generating creative content [1]. The collection provides a rich and diverse view of movies across various genres, languages, and countries [1].
Columns
The dataset includes the following columns:
- id: A unique identifier for each movie [2].
- title: The name of the movie [2].
- release_date: The date the movie was released [2].
- genres: Categories or types of the movie [2].
- original_language: The original language in which the movie was produced [2].
- vote_average: The average rating given to the movie [2].
- vote_count: The total number of votes received by the movie [2].
- popularity: A score indicating the movie's popularity [2].
- overview: A brief summary or synopsis of the movie [2].
- budget: The financial budget allocated for the movie's production [2].
Distribution
This dataset contains 9,616 unique movie titles [2]. The rows within the dataset are not duplicated [2].
Usage
This dataset is ideal for:
- Exploring movie and TV audience trends and preferences globally [1].
- Developing and enhancing movie recommendation systems [1].
- Conducting sentiment analysis on movie overviews or related content [1].
- Generating creative content based on movie data [1].
- Data science and analytics projects [1].
- Visualisation exercises [1].
Coverage
The dataset provides a global perspective on the entertainment industry, covering a wide array of genres, languages, and countries [1, 3].
- Genres: It features 5860 unique genre values, with Drama accounting for approximately 15% and Action for 11% [4].
- Languages: English is the most common original language at 74%, followed by Japanese at 10% [4].
- Production Companies: There are 9883 unique production company values [5].
License
CC-BY-NC
Who Can Use It
The dataset is suitable for:
- Data scientists and analysts interested in film industry trends [1].
- Developers building recommender systems for movies [1].
- Researchers conducting sentiment analysis on movie data [1].
- Beginners learning data analysis and visualisation techniques [1].
- Anyone interested in consumer and product data related to entertainment [1].
Dataset Name Suggestions
- Popular TMDB Movies Dataset
- Global Movie Insights from TMDB
- TMDB Film Data Collection
- Movie Audience Trends Data
Attributes
Original Data Source: 9500+ Popular Movies TMDB