Opendatabay APP

Film & Genre Insights Dataset

Product Reviews & Feedback

Tags and Keywords

Movies

Genres

Imdb

Film

Ratings

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Film & Genre Insights Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset features over 5,000 movies across various genres, offering detailed information about each. It comprises top-rated films gathered from the IMDb website, including titles of different genres and languages. The collection spans movies from the early 1930s up to the current year, all meticulously collected, cleaned, and arranged.

Columns

  • Movie_Title: Contains over 5,000 unique movie titles.
  • Year: Indicates the release year of the movie, ranging from the 1920s to 2022.
  • Director: Specifies the name of the director, with over 2,000 unique values.
  • Actors: Lists the names of the actors, featuring over 5,000 unique and multiple values.
  • Rating: Represents the IMDb rating out of 10, derived from over 25,000 voters.
  • main_genre: Denotes the primary genre of the film, including over 13 unique genre types.
  • side_genre: Provides secondary or multiple genres associated with the movie, with over 144 unique combinations.
  • Runtime(Mins): Shows the total duration of the movie in minutes.
  • Censor: Indicates the censorship certificate or rating applied to the movie.
  • Total_Gross: States the total box-office collection achieved by the movie.

Distribution

The dataset is available in a CSV file format, specifically named 'IMDb_All_Genres_etf_clean1.csv', with a file size of 797.22 kB. It comprises 10 distinct columns and a total of 5,562 individual records.

Usage

This dataset is well-suited for a variety of applications, such as developing movie recommendation systems, conducting detailed film industry analysis, identifying emerging genre trends, assessing director and actor performance, and exploring patterns in film censorship and box-office revenue. It is also valuable for educational purposes in data science and film studies.

Coverage

The dataset covers films released within the timeframe of 1920 to 2022. It includes movies from various genres and languages, providing a broad representation of cinematic history as documented on IMDb. There are no specific geographic limitations mentioned, suggesting a global scope of film representation.

License

CC0: Public Domain

Who Can Use It

This dataset is intended for data scientists, machine learning engineers, film researchers, media analysts, students, and anyone with an interest in cinematic data and trends. It is particularly useful for those aiming to build predictive models for film success, perform genre classification, or create content recommendation engines.

Dataset Name Suggestions

  • IMDb Global Movie Data
  • Film & Genre Insights Dataset
  • CineData IMDb Collection
  • Historical Movie Metrics
  • Multi-Decade Movie Database

Attributes

Original Data Source: Film & Genre Insights Dataset

Listing Stats

VIEWS

11

DOWNLOADS

4

LISTED

30/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format