Bengali Song Genre Corpus
News & Media Articles
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
A collection of over 4,000 Bangla song lyrics, titles, and categories, created to fill a gap in publicly available, updated resources, as previous datasets may no longer be maintained. The information was obtained using a script that successfully collected data from BanglaSongLyrics.com. The output is structured specifically as a CSV file, allowing developers and researchers to begin utilizing the resource immediately with basic commands.
Columns
- title: Represents the name of the individual song. This column contains 4020 unique values.
- category: Defines the genre of the music, which serves as a classification element. Examples of the 21 unique values include 'আধুনিক' and 'রবীন্দ্র সংগীত'.
- lyrics: Contains the actual textual content of the song. This field has 4087 unique records.
Distribution
The data is provided in a standard CSV format and is identified as
BanglaSongLyrics.csv. The file size is 5.5 MB. The structure consists of 3 columns and 4105 valid records. The data is expected to be updated on a quarterly basis.Usage
The dataset is highly useful for various applications in Natural Language Processing (NLP) focused on the Bengali language. It is ideal for training machine learning models to classify text based on genre, performing linguistic analysis on Bengali poetic forms, or acting as a large corpus of categorized textual data for other academic or developmental projects.
Coverage
The scope is focused exclusively on Bangla song lyrics. The content is drawn from the public domain and categorized across 21 distinct genres or categories. The collection totals over 4,000 individual songs.
License
CC0: Public Domain
Who Can Use It
Linguistic Researchers: Individuals interested in the study of structural elements, vocabulary, and themes present in Bengali lyrical text.
Data Scientists: Those building and refining NLP models, particularly for text classification or sentiment analysis on lyrical content.
Application Developers: Individuals requiring a readily accessible and sizable body of categorized Bengali text data for language application development.
Dataset Name Suggestions
- Bangla Song Lyrics Corpus
- Categorized Bengali Music Texts
- 4000+ Bangla Song Lyrics with Genre
- Updated Bangla Song Lyric Data
Attributes
Original Data Source: Bengali Song Genre Corpus
Loading...
