Movie Income and Description Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a detailed collection of movie information, including their categories, generated income, and textual descriptions. Sourced from DBpedia using SPARQL, it is designed to support natural language processing (NLP) tasks, such as predicting film income, and to facilitate the visualisation of various film features.
Columns
- id: A unique identifier for each movie entry.
- film: The name of the movie, accompanied by its corresponding link to DBpedia for additional details.
- income: The financial revenue generated by the movie.
- cat: The specific category associated with the movie.
- desc: A textual description of the movie.
Distribution
This dataset comprises a full collection of movies, with approximately 50,000 records. The data file is typically provided in a CSV format. While specific file size in MB or GB is not available, the dataset includes a diverse range of income figures and categories.
Usage
This dataset is ideally suited for a variety of applications:
- Training NLP models: Particularly for tasks like predicting movie income based on textual descriptions.
- Feature visualisation: Exploring trends and characteristics within the film industry.
- Machine learning research: Developing and testing regression models and neural networks.
- Text analysis: Understanding the content and themes of movie descriptions.
Coverage
The dataset has a global regional coverage, encompassing a wide array of films. While it includes categories such as American films and English-language films, a significant portion of the data falls under "Other" categories, indicating broad scope. No specific time range or demographic focus is detailed.
License
CC0
Who Can Use It
This dataset is beneficial for:
- Data scientists and machine learning engineers interested in building predictive models, especially for income forecasting.
- Researchers in natural language processing and text analytics.
- Academics and students studying the entertainment industry or data science applications.
- Anyone seeking to visualise and analyse film-related metrics.
Dataset Name Suggestions
- Movie Income and Description Dataset
- Film Revenue and Textual Synopses
- Global Motion Picture Metrics
- Cinema Earnings and Details
Attributes
Original Data Source: FIlms Income & Description