Global TV Programme Metadata
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Information about television programmes from IMDb, including details such as title, unique IMDb identifier, release year, genre, main cast members, and a brief synopsis. Additionally, the data covers average IMDb ratings (scaled from 1 to 10), episode or series runtime, content certificates, the total number of votes received, and, where available, gross revenue. This data is gathered using web scraping techniques from the IMDb website and is organised into separate CSV files for each genre, making it suitable for various analytical and programmatic applications.
Columns
- Title: The title of the television series.
- IMDb ID: The unique identifier for the series on IMDb.
- Release Year: The year in which the series was released.
- Genre: The genre or genres associated with the series.
- Cast: The main cast members of the series.
- Synopsis: A brief summary or description of the series.
- Rating: The average rating of the series on IMDb, scaled from 1 to 10.
- Runtime: The duration of each episode or the total runtime of the series.
- Certificate: The content rating or certificate assigned to the series (e.g., R, TV-MA).
- Number of Votes: The total count of votes or ratings received by the series.
- Gross Revenue: The total gross revenue generated by the series, if this information is available.
Distribution
The dataset is structured as individual CSV files, with each file corresponding to a specific genre of TV series. Examples include
action_series.csv
, adventure_series.csv
, animation_series.csv
, biography_series.csv
, comedy_series.csv
, crime_series.csv
, documentary_series.csv
, drama_series.csv
, family_series.csv
, fantasy_series.csv
, history_series.csv
, horror_series.csv
, music_series.csv
, musical_series.csv
, mystery_series.csv
, romance_series.csv
, sci-fi_series.csv
, sport_series.csv
, superhero_series.csv
, thriller_series.csv
, war_series.csv
, and western_series.csv
.Each file typically contains 11 columns. For instance, the
action_series.csv
file is approximately 3.99 MB. Most columns contain over 11,000 valid entries. However, certain columns have missing values: the Runtime
column has around 9% missing values, Certificate
has approximately 14% missing values, and Gross Revenue
has a notable 77% missing values, indicating this information is frequently not present.Usage
This dataset offers a wide array of ideal applications:
- TV Series Analysis: Researchers, analysts, and enthusiasts can delve into the characteristics, trends, and patterns of television programmes across diverse genres.
- Recommendation Systems: It is valuable for developing recommendation engines that suggest TV series based on user preferences, genre interests, or similar series.
- Genre-based Analysis: Users can conduct comparative analysis, pinpoint popular genres, and investigate relationships between genres and other variables such as ratings, runtime, certificate, number of votes, or gross revenue.
- Content Curation: Media organisations or streaming platforms can leverage this dataset to curate and recommend TV series to their audiences based on genre preferences, ratings, certificate, number of votes, or other relevant factors.
- Machine Learning and Natural Language Processing (NLP): The dataset can be used for training machine learning models or for NLP tasks such as sentiment analysis, text classification, or text generation, utilising the synopsis and other textual features.
Coverage
The data primarily covers television programmes listed on IMDb. The
Release Year
column includes programmes released up to the year 2023, with a wide span of years represented, as indicated by 756 unique release years. There are no specific demographic details provided within the dataset description. Data availability varies across columns, with information like gross revenue frequently being absent.License
CC0: Public Domain
Who Can Use It
- Researchers and Analysts: For exploring trends and patterns in television programmes.
- Data Scientists and Machine Learning Engineers: For building recommendation systems and training NLP models.
- Media Companies and Streaming Platforms: For content curation and strategic decision-making.
- TV Series Enthusiasts: For personal analysis and discovery of programme characteristics.
Dataset Name Suggestions
- IMDb Television Series Data
- Global TV Programme Metadata
- Streaming Series Insights
- Television Programme Analytics
- IMDb Scraped TV Data
Attributes
Original Data Source: Global TV Programme Metadata