Opendatabay APP

Bengali Song Genre Corpus

News & Media Articles

Tags and Keywords

Bangla

Lyrics

Song

Music

Bengali

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Bengali Song Genre Corpus Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

A collection of over 4,000 Bangla song lyrics, titles, and categories, created to fill a gap in publicly available, updated resources, as previous datasets may no longer be maintained. The information was obtained using a script that successfully collected data from BanglaSongLyrics.com. The output is structured specifically as a CSV file, allowing developers and researchers to begin utilizing the resource immediately with basic commands.

Columns

  • title: Represents the name of the individual song. This column contains 4020 unique values.
  • category: Defines the genre of the music, which serves as a classification element. Examples of the 21 unique values include 'আধুনিক' and 'রবীন্দ্র সংগীত'.
  • lyrics: Contains the actual textual content of the song. This field has 4087 unique records.

Distribution

The data is provided in a standard CSV format and is identified as BanglaSongLyrics.csv. The file size is 5.5 MB. The structure consists of 3 columns and 4105 valid records. The data is expected to be updated on a quarterly basis.

Usage

The dataset is highly useful for various applications in Natural Language Processing (NLP) focused on the Bengali language. It is ideal for training machine learning models to classify text based on genre, performing linguistic analysis on Bengali poetic forms, or acting as a large corpus of categorized textual data for other academic or developmental projects.

Coverage

The scope is focused exclusively on Bangla song lyrics. The content is drawn from the public domain and categorized across 21 distinct genres or categories. The collection totals over 4,000 individual songs.

License

CC0: Public Domain

Who Can Use It

Linguistic Researchers: Individuals interested in the study of structural elements, vocabulary, and themes present in Bengali lyrical text. Data Scientists: Those building and refining NLP models, particularly for text classification or sentiment analysis on lyrical content. Application Developers: Individuals requiring a readily accessible and sizable body of categorized Bengali text data for language application development.

Dataset Name Suggestions

  1. Bangla Song Lyrics Corpus
  2. Categorized Bengali Music Texts
  3. 4000+ Bangla Song Lyrics with Genre
  4. Updated Bangla Song Lyric Data

Attributes

Original Data Source: Bengali Song Genre Corpus

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

18/11/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format