Opendatabay APP

Netflix IMDB Dataset

Entertainment & Media Consumption

Tags and Keywords

Arts

Movies

Nlp

Recommender

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Netflix IMDB Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a detailed list and metadata for approximately 7,000 TV shows and movies available on Netflix as of June 2021. Sourced from the IMDB website, it offers insights into content characteristics, popularity, and categorisation, making it suitable for various analytical and machine learning applications.

Columns

  • imdb_id: A unique identifier for each show or movie.
  • title: The title of the television programme or film.
  • popular_rank: The ranking assigned by IMDB based on popularity.
  • certificate: Age certifications received by the content; it is noted that many values may be null.
  • startYear: The year the show was first broadcast or the film was released.
  • endYear: The year a show concluded, if applicable.
  • episodes: The total number of episodes in a series; for films, this value is 1.
  • runtime: The running time of the content.
  • type: Specifies whether the content is a 'Movie' or 'Series'.
  • orign_country: The country of origin for the show or movie.
  • language: The primary language of the content.
  • plot: A synopsis of the show or movie.
  • summary: A concise summary of the story.
  • rating: The average user rating for the content.
  • numVotes: The total number of votes received for the content's rating.
  • genres: The genre(s) to which the show or movie belongs.
  • isAdult: A binary indicator (1 for adult content, 0 otherwise).
  • cast: The main cast members listed in a suitable format.
  • image_url: A link to the poster image for the content.

Distribution

The dataset is typically provided as a CSV file, specifically named netflix_list.csv. It contains approximately 7,000 records, with 7,008 unique identifiers for shows and movies. This dataset is listed as version 1.0 and was added to the platform on 11 June 2025.

Usage

This dataset is ideally suited for developing recommender systems, performing natural language processing (NLP) tasks on plot summaries, and conducting market analysis of entertainment content. It can be used to explore trends in movie and TV show production, analyse viewer preferences, and facilitate content categorisation efforts.

Coverage

The dataset offers global coverage, with information on content originating from various countries. The startYear of content spans from 1932 to 2022, with the majority of content released between 2004 and 2022. The endYear ranges from 1969 to 2022, with most data concentrated from 2011 to 2022. It includes age certification information and an indicator for adult content, allowing for demographic considerations related to content suitability.

License

CCO

Who Can Use It

This dataset is valuable for data scientists and machine learning engineers working on content recommendation engines or text analysis projects. It is also beneficial for researchers studying media consumption patterns and entertainment industry analysts interested in exploring the Netflix content catalogue programmatically.

Dataset Name Suggestions

  • Netflix Content Metadata (June 2021)
  • Global Netflix Catalogue
  • Netflix IMDB Dataset
  • Streaming Content Insights (Netflix)
  • Netflix Movie and TV Show Archive

Attributes

Listing Stats

VIEWS

2

DOWNLOADS

0

LISTED

11/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free