Hotstar TV and Movie Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a detailed catalogue of television shows and movies available on Disney+ Hotstar, a leading Indian subscription video on-demand service [2]. Disney+ Hotstar, owned by Novi Digital Entertainment of Disney Star, integrates content from Disney Star's local networks, including films, TV series, live sports, and original programming [2]. It also features content licensed from third-parties such as HBO and Showtime [2]. Following Disney's acquisition of 21st Century Fox in 2019, the platform expanded in April 2020 to include original programming, films, and television series from major Disney brands like Walt Disney Studios, Pixar, Marvel Studios, Lucasfilm, and National Geographic [2]. The service quickly became a dominant streaming platform in India and also operates in Indonesia, Malaysia, and Thailand, combining local, third-party entertainment with the broader Disney+ library [2]. This dataset offers insights into the platform's content offerings and media consumption trends [2, 3].
Columns
- hotstar_id: A unique identifier for each TV show or movie [4, 5].
- title: The name of the TV show or movie [4, 5].
- description: A short summary describing the content [4, 5].
- genre: The genre classification of the content [4, 5].
- year: The release year of the content [4, 5].
- age_rating: The age rating assigned to the content, as listed [4, 5].
- running_time: The duration of a movie, measured in minutes [4, 5].
- season: The total number of seasons for a TV show [4, 5].
- episodes: The total number of episodes for a TV show [4, 5].
- type: Indicates whether the content is a TV show or a movie [4, 5].
Distribution
This dataset comprises 6,245 unique TV shows and movies, each described by 10 distinct attributes [4]. The content spans release years from 1928 to 2023 [6]. Analysis of the content reveals that Drama accounts for 30% of the entries, Comedy for 12%, with other genres making up the remaining 59% [6]. The age ratings are predominantly U/A 13+ (43%) and U (18%) [6]. For movies, running times range from 1 to 229 minutes, with a notable concentration between 115.00 and 137.80 minutes [7]. The dataset is composed of 66% movies and 34% TV shows [7]. The typical data file format is CSV [8].
Usage
This dataset is ideal for a variety of applications and use cases [1]:
- Analysing content trends and genre popularity on streaming platforms [3].
- Developing and evaluating recommender systems for media content [3].
- Conducting market research on entertainment and media consumption [2, 3].
- Performing Natural Language Processing (NLP) tasks using content descriptions and titles [3].
- Studying content distribution across different age ratings and release years.
- Understanding the content catalogue of a major over-the-top streaming service [2].
Coverage
The geographic scope of the content primarily pertains to India, given Disney+ Hotstar's origins and primary market [2]. However, the service also operates in Indonesia, Malaysia, and Thailand [2]. Hotstar additionally targets overseas Indian audiences in markets such as Singapore, Canada, and the United Kingdom, although it operates as a service distinct from Disney+ in these regions [2]. The dataset includes content released between 1928 and 2023 [6]. Demographic scope is addressed through various age ratings assigned to the content, such as U/A 13+, U, U/A 16+, A, and U/A 7+ [6, 7]. The listed region for the dataset is GLOBAL [9].
License
CC-BY-SA
Who Can Use It
This dataset is suitable for a wide range of users and their specific needs [1]:
- Data Scientists and Analysts: To perform statistical analysis, identify content trends, and build predictive models within the entertainment sector.
- Academics and Researchers: To study media consumption patterns, content strategies, and the cultural impacts of streaming services.
- Developers: For building and enhancing content recommendation engines or AI models aimed at media understanding and content generation [3, 9].
- Content Creators and Producers: To gain insights into popular genres, themes, and audience preferences for new productions.
- Marketplace Users: Seeking data related to entertainment and media consumption, specifically movies and TV shows [3].
Dataset Name Suggestions
- Disney+ Hotstar Content Catalogue
- Hotstar TV and Movie Data
- Streaming Media Catalogue (Hotstar)
- Indian Streaming Entertainment Database
- Disney+ Hotstar Programme Inventory
Attributes
Original Data Source: Disney+ Hotstar Tv and Movie Catalog