Entertainment Movie Metrics
News & Media Articles
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides detailed information on the top 10,000 movies sourced from IMDb, serving as a valuable resource for various forms of film analysis and research. It covers a significant historical span, encompassing movies released between 1915 and 2023, and includes key attributes such as user ratings, financial performance, and cast details. The data was meticulously gathered through web scraping for educational and research purposes, offering insights into cinema history, entertainment industry trends, and audience reception. It is structured to support a range of analytical applications, from industry studies to academic investigations.
Columns
The dataset features 12 distinct columns, each providing specific details about the movies:
- movie_name: The title of the movie.
- year: The release year of the movie.
- rating: The IMDb user rating.
- metascore: The Metascore rating.
- gross_income: The gross income generated by the movie.
- votes: The total number of votes received on IMDb.
- runtime: The duration of the movie in minutes.
- genre: The genre or genres associated with the movie.
- certificate: The certification or rating of the movie.
- description: A brief summary or plot outline of the movie.
- directors: The director(s) of the movie.
- stars: The main cast or actors featured in the movie.
Distribution
The dataset is delivered in CSV format and occupies 2MB of space. It comprises 10,000 rows (records) and 12 columns, designed for straightforward import and analysis.
Usage
This dataset is ideally suited for a variety of applications and use cases, including:
- Film industry analysis, to understand market dynamics and performance.
- Sentiment analysis, by examining user ratings and reviews to gauge public opinion.
- Recommender system development, to build personalised movie suggestions for users.
- Trend analysis, to observe changes in movie attributes and popularity over different decades.
- Academic research and case studies related to movies, entertainment, and cultural studies.
- General data analysis and natural language processing (NLP) applications utilising movie descriptions and metadata.
Coverage
The dataset spans movie releases from 1915 to 2023, providing a historical perspective on cinema. While primarily focused on films and entertainment relevant to the American population, it also includes information on movies involving various countries and languages, reflecting the global nature of the film industry.
License
CC0 Public Domain
Who Can Use It
This dataset is intended for a broad audience of users, including:
- Film industry professionals for market research, content strategy, and competitive analysis.
- Data scientists and analysts developing predictive models, exploring movie trends, or creating visualisations.
- Academics and researchers conducting studies on film history, cultural impact, or data science methodologies.
- Developers creating new entertainment applications, recommendation engines, or data-driven platforms.
- Anyone with an interest in the history and evolution of movies and the entertainment sector.
Dataset Name Suggestions
- IMDb Top 10,000 Movies Data
- Historical IMDb Movie Dataset
- Global Film Ratings and Details
- Cinema Data Collection
- Entertainment Movie Metrics
Attributes
Original Data Source: Entertainment Movie Metrics