Opendatabay APP

Global TV Programme Metadata

Product Reviews & Feedback

Tags and Keywords

Imdb

Television

Genre

Ratings

Analysis

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Global TV Programme Metadata Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Information about television programmes from IMDb, including details such as title, unique IMDb identifier, release year, genre, main cast members, and a brief synopsis. Additionally, the data covers average IMDb ratings (scaled from 1 to 10), episode or series runtime, content certificates, the total number of votes received, and, where available, gross revenue. This data is gathered using web scraping techniques from the IMDb website and is organised into separate CSV files for each genre, making it suitable for various analytical and programmatic applications.

Columns

  • Title: The title of the television series.
  • IMDb ID: The unique identifier for the series on IMDb.
  • Release Year: The year in which the series was released.
  • Genre: The genre or genres associated with the series.
  • Cast: The main cast members of the series.
  • Synopsis: A brief summary or description of the series.
  • Rating: The average rating of the series on IMDb, scaled from 1 to 10.
  • Runtime: The duration of each episode or the total runtime of the series.
  • Certificate: The content rating or certificate assigned to the series (e.g., R, TV-MA).
  • Number of Votes: The total count of votes or ratings received by the series.
  • Gross Revenue: The total gross revenue generated by the series, if this information is available.

Distribution

The dataset is structured as individual CSV files, with each file corresponding to a specific genre of TV series. Examples include action_series.csv, adventure_series.csv, animation_series.csv, biography_series.csv, comedy_series.csv, crime_series.csv, documentary_series.csv, drama_series.csv, family_series.csv, fantasy_series.csv, history_series.csv, horror_series.csv, music_series.csv, musical_series.csv, mystery_series.csv, romance_series.csv, sci-fi_series.csv, sport_series.csv, superhero_series.csv, thriller_series.csv, war_series.csv, and western_series.csv.
Each file typically contains 11 columns. For instance, the action_series.csv file is approximately 3.99 MB. Most columns contain over 11,000 valid entries. However, certain columns have missing values: the Runtime column has around 9% missing values, Certificate has approximately 14% missing values, and Gross Revenue has a notable 77% missing values, indicating this information is frequently not present.

Usage

This dataset offers a wide array of ideal applications:
  • TV Series Analysis: Researchers, analysts, and enthusiasts can delve into the characteristics, trends, and patterns of television programmes across diverse genres.
  • Recommendation Systems: It is valuable for developing recommendation engines that suggest TV series based on user preferences, genre interests, or similar series.
  • Genre-based Analysis: Users can conduct comparative analysis, pinpoint popular genres, and investigate relationships between genres and other variables such as ratings, runtime, certificate, number of votes, or gross revenue.
  • Content Curation: Media organisations or streaming platforms can leverage this dataset to curate and recommend TV series to their audiences based on genre preferences, ratings, certificate, number of votes, or other relevant factors.
  • Machine Learning and Natural Language Processing (NLP): The dataset can be used for training machine learning models or for NLP tasks such as sentiment analysis, text classification, or text generation, utilising the synopsis and other textual features.

Coverage

The data primarily covers television programmes listed on IMDb. The Release Year column includes programmes released up to the year 2023, with a wide span of years represented, as indicated by 756 unique release years. There are no specific demographic details provided within the dataset description. Data availability varies across columns, with information like gross revenue frequently being absent.

License

CC0: Public Domain

Who Can Use It

  • Researchers and Analysts: For exploring trends and patterns in television programmes.
  • Data Scientists and Machine Learning Engineers: For building recommendation systems and training NLP models.
  • Media Companies and Streaming Platforms: For content curation and strategic decision-making.
  • TV Series Enthusiasts: For personal analysis and discovery of programme characteristics.

Dataset Name Suggestions

  • IMDb Television Series Data
  • Global TV Programme Metadata
  • Streaming Series Insights
  • Television Programme Analytics
  • IMDb Scraped TV Data

Attributes

Original Data Source: Global TV Programme Metadata

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

08/09/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format