Opendatabay APP

Entertainment Movie Metrics

News & Media Articles

Tags and Keywords

Movies

Imdb

Film

Ratings

Entertainment

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Entertainment Movie Metrics Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides detailed information on the top 10,000 movies sourced from IMDb, serving as a valuable resource for various forms of film analysis and research. It covers a significant historical span, encompassing movies released between 1915 and 2023, and includes key attributes such as user ratings, financial performance, and cast details. The data was meticulously gathered through web scraping for educational and research purposes, offering insights into cinema history, entertainment industry trends, and audience reception. It is structured to support a range of analytical applications, from industry studies to academic investigations.

Columns

The dataset features 12 distinct columns, each providing specific details about the movies:
  • movie_name: The title of the movie.
  • year: The release year of the movie.
  • rating: The IMDb user rating.
  • metascore: The Metascore rating.
  • gross_income: The gross income generated by the movie.
  • votes: The total number of votes received on IMDb.
  • runtime: The duration of the movie in minutes.
  • genre: The genre or genres associated with the movie.
  • certificate: The certification or rating of the movie.
  • description: A brief summary or plot outline of the movie.
  • directors: The director(s) of the movie.
  • stars: The main cast or actors featured in the movie.

Distribution

The dataset is delivered in CSV format and occupies 2MB of space. It comprises 10,000 rows (records) and 12 columns, designed for straightforward import and analysis.

Usage

This dataset is ideally suited for a variety of applications and use cases, including:
  • Film industry analysis, to understand market dynamics and performance.
  • Sentiment analysis, by examining user ratings and reviews to gauge public opinion.
  • Recommender system development, to build personalised movie suggestions for users.
  • Trend analysis, to observe changes in movie attributes and popularity over different decades.
  • Academic research and case studies related to movies, entertainment, and cultural studies.
  • General data analysis and natural language processing (NLP) applications utilising movie descriptions and metadata.

Coverage

The dataset spans movie releases from 1915 to 2023, providing a historical perspective on cinema. While primarily focused on films and entertainment relevant to the American population, it also includes information on movies involving various countries and languages, reflecting the global nature of the film industry.

License

CC0 Public Domain

Who Can Use It

This dataset is intended for a broad audience of users, including:
  • Film industry professionals for market research, content strategy, and competitive analysis.
  • Data scientists and analysts developing predictive models, exploring movie trends, or creating visualisations.
  • Academics and researchers conducting studies on film history, cultural impact, or data science methodologies.
  • Developers creating new entertainment applications, recommendation engines, or data-driven platforms.
  • Anyone with an interest in the history and evolution of movies and the entertainment sector.

Dataset Name Suggestions

  • IMDb Top 10,000 Movies Data
  • Historical IMDb Movie Dataset
  • Global Film Ratings and Details
  • Cinema Data Collection
  • Entertainment Movie Metrics

Attributes

Original Data Source: Entertainment Movie Metrics

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

07/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format