Hindi Film Debut and Box Office Dataset
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Detailed metrics for 1,698 Hindi-language films released within India spanning the years 2005 to 2017. The data captures essential attributes related to film production, market performance, and talent contributions. Key elements include financial metrics, such as revenue, budget, and screen count, alongside categorical variables detailing genre, franchise status, and whether the primary contributors (lead actors, directors, and music directors) were making their debut. This information provides a valuable foundation for examining trends and financial performance within the contemporary Bollywood industry.
Columns
The dataset contains 14 distinct fields, providing detailed information about each film:
- Movie_Name: The title of the released movie. The dataset contains 1,695 unique titles.
- Release_Period: Categorisation of the release time, indicating if the movie was released during a Normal period (63%) or a Holiday period (37%).
- Whether_Remake: Boolean indicating if the film is a recreation of a previously released movie (Yes/No). Only 4% of the films are remakes.
- Whether_Franchise: Specifies if the film belongs to a franchise or cinematic universe (Yes/No). 5% of the movies are part of a franchise.
- Genre: The primary category of the film. Drama is the most frequent genre (38%), followed by Comedy (17%). There are 14 unique genres identified.
- New_Actor: Indicates whether the lead actor is making a debut (True for 27% of records).
- New_Director: Indicates if the director is making their debut (True for 48% of records).
- New_Music_Director: Indicates if the music director is working on their first film (True for 33% of records).
- Lead_Star: The name of the primary actor or actress. There are 764 unique lead stars, with Akshay Kumar being the most frequent, appearing in 3% of the films.
- Director: The name of the movie's director. There are 1,048 unique directors listed.
- Music_Director: The name of the music director or composer. Pritam is the most frequent, appearing in 5% of the movies.
- Number_of_Screens: The total count of screens where the film was released. The average number of screens is 554.
- Revenue(INR): The total box office revenue generated in Indian Rupees (INR). The average revenue is 150 million.
- Budget(INR): The estimated production budget of the movie in Indian Rupees (INR). The average budget is 238 million.
Distribution
The data product is structured as a file named repository.csv, with a file size of approximately 186.86 kB. It contains 1,698 records, with 14 columns dedicated to movie details and metrics. Data validity stands at 100% across all listed columns. The expected update frequency for this dataset is never.
Usage
This dataset is ideal for:
- Exploratory Data Analysis: Investigating correlations between financial metrics (budget, revenue, screens) and release characteristics (genre, franchise status, release period).
- Data Visualization: Creating graphical representations of industry trends over the 2005-2017 period.
- Performance Analysis: Studying the success rates and financial outcomes associated with movies featuring debut talent (actors, directors, music directors).
- Market Research: Analysing the financial viability of different genres, remakes, or franchise entries within the Indian market.
Coverage
The data focuses exclusively on Hindi-language movies released within India. The temporal scope spans the period 2005 through 2017.
License
CC BY-SA 4.0
Who Can Use It
- Film Industry Analysts: To benchmark financial performance, screen allocations, and budget planning.
- Data Scientists/Statisticians: For building predictive models regarding box office success based on production attributes.
- Academic Researchers: Studying the evolution and trends of the Indian cinema market.
- Hobbyists: Interested in generating insights and visualising key statistics about Bollywood movies.
Dataset Name Suggestions
- Bollywood Movie Financial and Attribute Data (2005-2017)
- Indian Cinema Performance Metrics
- Hindi Film Debut and Box Office Dataset
Attributes
Original Data Source: Hindi Film Debut and Box Office Dataset
Loading...
