Netflix IMDB Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a detailed list and metadata for approximately 7,000 TV shows and movies available on Netflix as of June 2021. Sourced from the IMDB website, it offers insights into content characteristics, popularity, and categorisation, making it suitable for various analytical and machine learning applications.
Columns
- imdb_id: A unique identifier for each show or movie.
- title: The title of the television programme or film.
- popular_rank: The ranking assigned by IMDB based on popularity.
- certificate: Age certifications received by the content; it is noted that many values may be null.
- startYear: The year the show was first broadcast or the film was released.
- endYear: The year a show concluded, if applicable.
- episodes: The total number of episodes in a series; for films, this value is 1.
- runtime: The running time of the content.
- type: Specifies whether the content is a 'Movie' or 'Series'.
- orign_country: The country of origin for the show or movie.
- language: The primary language of the content.
- plot: A synopsis of the show or movie.
- summary: A concise summary of the story.
- rating: The average user rating for the content.
- numVotes: The total number of votes received for the content's rating.
- genres: The genre(s) to which the show or movie belongs.
- isAdult: A binary indicator (1 for adult content, 0 otherwise).
- cast: The main cast members listed in a suitable format.
- image_url: A link to the poster image for the content.
Distribution
The dataset is typically provided as a CSV file, specifically named
netflix_list.csv
. It contains approximately 7,000 records, with 7,008 unique identifiers for shows and movies. This dataset is listed as version 1.0 and was added to the platform on 11 June 2025.Usage
This dataset is ideally suited for developing recommender systems, performing natural language processing (NLP) tasks on plot summaries, and conducting market analysis of entertainment content. It can be used to explore trends in movie and TV show production, analyse viewer preferences, and facilitate content categorisation efforts.
Coverage
The dataset offers global coverage, with information on content originating from various countries. The
startYear
of content spans from 1932 to 2022, with the majority of content released between 2004 and 2022. The endYear
ranges from 1969 to 2022, with most data concentrated from 2011 to 2022. It includes age certification information and an indicator for adult content, allowing for demographic considerations related to content suitability.License
CCO
Who Can Use It
This dataset is valuable for data scientists and machine learning engineers working on content recommendation engines or text analysis projects. It is also beneficial for researchers studying media consumption patterns and entertainment industry analysts interested in exploring the Netflix content catalogue programmatically.
Dataset Name Suggestions
- Netflix Content Metadata (June 2021)
- Global Netflix Catalogue
- Netflix IMDB Dataset
- Streaming Content Insights (Netflix)
- Netflix Movie and TV Show Archive
Attributes
Original Data Source:Netflix Movie and TV Shows (June 2021)