Classic Bengali Novels and Short Stories Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
The Complete Works of Sarat Chandra Chattopadhyay dataset offers literary works by the renowned Bengali novelist and short story writer (1876–1938). His writings illuminate the lifestyle, tragedies, and struggles of village inhabitants, alongside contemporary social customs prevalent in Bengal during the early 20th century. Chattopadhyay remains a widely popular, translated, and adapted Indian author.
Columns
name
: Name of the literary item.collection
: Collection to which the literary item belongs.genre
: Genre of the literary item.content
: Formatted content of the literary item.
Distribution
This free dataset comprises five files: three CSV files located in the
/csv
directory and two TXT files in the /txt
directory. Each CSV file contains literary works organised by genre, with an all_collection.csv
file combining works from all genres. Each TXT file presents aggregated literary works by genre. All content has undergone basic preprocessing to remove empty spaces and in-page titles. The TXT formats are ideal for training various sequential models, whilst the CSV formats are suitable for literary analyses and comparative studies. The dataset is version 1.0 and has a quality rating of 5 out of 5.Usage
- Training sequential models using the TXT formats.
- Conducting literary analyses using the CSV formats.
- Performing comparative studies of literary works using the CSV formats.
- Exploring Bengali literature through machine learning techniques.
Coverage
The dataset covers the literary works of Sarat Chandra Chattopadhyay, an author active in the early 20th century (1876–1938). The context of the works primarily focuses on the social practices and life in Bengal villages.
License
CC0
Who Can Use It
- Machine learning enthusiasts interested in natural language processing and sequential model training.
- Literary experts and researchers for in-depth analyses of Bengali literature.
- Academics and students studying social history and literary trends in early 20th-century Bengal.
Dataset Name Suggestions
- Sarat Chandra Chattopadhyay: Complete Literary Works
- Bengali Literature Collection by Sarat Chandra
- Sarat Chandra Chattopadhyay Corpus
- Classic Bengali Novels and Short Stories Dataset
Attributes
Original Data Source: Complete Works of Sarat Chandra Chattopadhyay