Bengali Cinema Metadata
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides details on Bengali movies, designed to support the development of recommendation systems. It is a valuable resource for projects involving Natural Language Processing (NLP) and K-Nearest Neighbour (KNN) or Cosine Similarity algorithms. The data offers insights into film content, suitable for analysis in the entertainment and media consumption sectors.
Columns
- platform_Name: The name of the streaming platform where the movie is available (e.g., Hoichoi, Chorki).
- movieId: A unique identifier assigned to each film.
- title: The name of the Bengali movie.
- genres: Categories or types of the movie (e.g., Drama, Thriller).
- director: The individual or individuals responsible for directing the movie. Some entries may not have a director listed.
- starring: The main actors or actresses featured in the movie.
Distribution
The dataset is typically provided in a CSV file format. It contains details for approximately 381 unique Bengali movies. The structure includes various textual and categorical fields, with a focus on movie metadata for analytical purposes.
Usage
This dataset is ideally suited for:
- Building and evaluating movie recommendation systems, especially those leveraging K-Nearest Neighbour and Cosine Similarity.
- Undertaking Natural Language Processing (NLP) tasks related to movie titles, genres, or cast information.
- Data visualisation projects to explore trends in Bengali cinema.
- Academic research into streaming platform content and audience preferences within the Bengali film industry.
Coverage
The data is globally applicable, focusing specifically on Bengali movies. It includes content from prominent streaming platforms, with Hoichoi representing 57% of entries and Chorki accounting for 43%. The dataset encompasses various genres, directors, and starring actors, providing a snapshot of available film content. No specific time range or demographic scope beyond the Bengali language focus is detailed in the source material.
License
CC0
Who Can Use It
- Data scientists and machine learning engineers keen on developing recommendation algorithms.
- NLP researchers looking for a domain-specific text corpus.
- Academics and students from institutions like BRAC University conducting media studies or data science research.
- Analysts interested in content trends and distribution patterns on streaming platforms.
Dataset Name Suggestions
- Bengali Movie Content Dataset
- Streaming Bengali Film Data
- Bengali Cinema Metadata
- Movie Recommendation System Data (Bengali)
- BRAC University Bengali Movies
Attributes
Original Data Source: Bengali Movie Dataset