Film & Genre Insights Dataset
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset features over 5,000 movies across various genres, offering detailed information about each. It comprises top-rated films gathered from the IMDb website, including titles of different genres and languages. The collection spans movies from the early 1930s up to the current year, all meticulously collected, cleaned, and arranged.
Columns
- Movie_Title: Contains over 5,000 unique movie titles.
- Year: Indicates the release year of the movie, ranging from the 1920s to 2022.
- Director: Specifies the name of the director, with over 2,000 unique values.
- Actors: Lists the names of the actors, featuring over 5,000 unique and multiple values.
- Rating: Represents the IMDb rating out of 10, derived from over 25,000 voters.
- main_genre: Denotes the primary genre of the film, including over 13 unique genre types.
- side_genre: Provides secondary or multiple genres associated with the movie, with over 144 unique combinations.
- Runtime(Mins): Shows the total duration of the movie in minutes.
- Censor: Indicates the censorship certificate or rating applied to the movie.
- Total_Gross: States the total box-office collection achieved by the movie.
Distribution
The dataset is available in a CSV file format, specifically named 'IMDb_All_Genres_etf_clean1.csv', with a file size of 797.22 kB. It comprises 10 distinct columns and a total of 5,562 individual records.
Usage
This dataset is well-suited for a variety of applications, such as developing movie recommendation systems, conducting detailed film industry analysis, identifying emerging genre trends, assessing director and actor performance, and exploring patterns in film censorship and box-office revenue. It is also valuable for educational purposes in data science and film studies.
Coverage
The dataset covers films released within the timeframe of 1920 to 2022. It includes movies from various genres and languages, providing a broad representation of cinematic history as documented on IMDb. There are no specific geographic limitations mentioned, suggesting a global scope of film representation.
License
CC0: Public Domain
Who Can Use It
This dataset is intended for data scientists, machine learning engineers, film researchers, media analysts, students, and anyone with an interest in cinematic data and trends. It is particularly useful for those aiming to build predictive models for film success, perform genre classification, or create content recommendation engines.
Dataset Name Suggestions
- IMDb Global Movie Data
- Film & Genre Insights Dataset
- CineData IMDb Collection
- Historical Movie Metrics
- Multi-Decade Movie Database
Attributes
Original Data Source: Film & Genre Insights Dataset