Opendatabay APP

Bengali Cinema Metadata

Entertainment & Media Consumption

Tags and Keywords

Movies

Tv

Shows

Data

Visualization

Nlp

K-means

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Bengali Cinema Metadata Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides details on Bengali movies, designed to support the development of recommendation systems. It is a valuable resource for projects involving Natural Language Processing (NLP) and K-Nearest Neighbour (KNN) or Cosine Similarity algorithms. The data offers insights into film content, suitable for analysis in the entertainment and media consumption sectors.

Columns

  • platform_Name: The name of the streaming platform where the movie is available (e.g., Hoichoi, Chorki).
  • movieId: A unique identifier assigned to each film.
  • title: The name of the Bengali movie.
  • genres: Categories or types of the movie (e.g., Drama, Thriller).
  • director: The individual or individuals responsible for directing the movie. Some entries may not have a director listed.
  • starring: The main actors or actresses featured in the movie.

Distribution

The dataset is typically provided in a CSV file format. It contains details for approximately 381 unique Bengali movies. The structure includes various textual and categorical fields, with a focus on movie metadata for analytical purposes.

Usage

This dataset is ideally suited for:
  • Building and evaluating movie recommendation systems, especially those leveraging K-Nearest Neighbour and Cosine Similarity.
  • Undertaking Natural Language Processing (NLP) tasks related to movie titles, genres, or cast information.
  • Data visualisation projects to explore trends in Bengali cinema.
  • Academic research into streaming platform content and audience preferences within the Bengali film industry.

Coverage

The data is globally applicable, focusing specifically on Bengali movies. It includes content from prominent streaming platforms, with Hoichoi representing 57% of entries and Chorki accounting for 43%. The dataset encompasses various genres, directors, and starring actors, providing a snapshot of available film content. No specific time range or demographic scope beyond the Bengali language focus is detailed in the source material.

License

CC0

Who Can Use It

  • Data scientists and machine learning engineers keen on developing recommendation algorithms.
  • NLP researchers looking for a domain-specific text corpus.
  • Academics and students from institutions like BRAC University conducting media studies or data science research.
  • Analysts interested in content trends and distribution patterns on streaming platforms.

Dataset Name Suggestions

  • Bengali Movie Content Dataset
  • Streaming Bengali Film Data
  • Bengali Cinema Metadata
  • Movie Recommendation System Data (Bengali)
  • BRAC University Bengali Movies

Attributes

Original Data Source: Bengali Movie Dataset

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

27/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format