US HBO Max Shows and Films
News & Media Articles
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a detailed listing of TV shows and movies available on HBO Max in May 2022, specifically covering content accessible in the United States. Its primary purpose is to enable the analysis of streaming content, offering insights into trends and facilitating the development of data-driven applications. The dataset is structured into two distinct files:
titles.csv
, which details over 3,000 unique titles with their attributes, and credits.csv
, which records over 66,000 cast and crew credits for these titles. It is an excellent resource for understanding the HBO Max catalogue and for various data science tasks.Columns
The dataset is split into two files, each with distinct columns:
titles.csv:
- id: The unique identifier for the title on JustWatch.
- title: The name of the movie or TV show.
- show_type: Categorises the content as either a TV show or a movie.
- description: A concise summary of the title's plot or subject.
- release_year: The year the content was originally released.
- age_certification: The age rating for the content.
- runtime: The duration in minutes for movies, or the average episode length for TV shows.
- genres: A list of genres associated with the title.
- production_countries: A list of countries involved in the title's production.
- seasons: The number of seasons available, applicable only to TV shows.
- imdb_id: The unique identifier for the title on IMDb.
- imdb_score: The user score from IMDb.
- imdb_votes: The total number of votes received on IMDb.
- tmdb_popularity: The popularity score from The Movie Database (TMDB).
- tmdb_score: The user score from TMDB.
credits.csv:
- person_ID: The unique identifier for the person (actor or director) on JustWatch.
- id: The identifier of the title associated with the credit.
- name: The real name of the actor or director.
- character_name: The name of the character portrayed (primarily for actors).
- role: Specifies the person's role, either ACTOR or DIRECTOR.
Distribution
The dataset is distributed as two separate CSV files:
titles.csv
and credits.csv
.
The credits.csv
file has a size of 3.38 MB.
It contains information on over 3,000 unique titles and over 66,000 credit entries. Specifically, the credits.csv
file holds 66.4k valid records. The character
column in credits.csv
has 7% missing values, while all other columns in credits.csv
are 100% valid.Usage
This dataset is ideal for:
- Developing content-based recommender systems utilising genres and descriptions.
- Identifying the main types of content available on the HBO Max streaming service.
- Conducting network analysis based on the cast and crew relationships within titles.
- Performing exploratory data analysis to uncover interesting patterns and insights into streaming content.
Coverage
- Geographic Scope: The data focuses solely on content available in the United States.
- Time Range: The dataset was compiled using data acquired in May 2022.
- Update Frequency: This dataset is expected to be updated annually.
License
CC0: Public Domain
Who Can Use It
- Data scientists and machine learning engineers interested in building recommendation engines.
- Researchers and analysts studying streaming service content strategies or audience behaviour.
- Students and hobbyists seeking a real-world dataset for practice in data cleaning, visualisation, and analysis.
- Anyone requiring structured data for media content insights and trend identification.
Dataset Name Suggestions
- HBO Max Content Catalogue (May 2022)
- HBO Max Streaming Data
- US HBO Max Shows and Films
- HBO Max Content Insights
Attributes
Original Data Source: US HBO Max Shows and Films