Opendatabay APP

Classic Bengali Novels and Short Stories Dataset

Entertainment & Media Consumption

Tags and Keywords

Arts

And

Entertainment

Literature

Nlp

Languages

Bengali

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Classic Bengali Novels and Short Stories Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

The Complete Works of Sarat Chandra Chattopadhyay dataset offers literary works by the renowned Bengali novelist and short story writer (1876–1938). His writings illuminate the lifestyle, tragedies, and struggles of village inhabitants, alongside contemporary social customs prevalent in Bengal during the early 20th century. Chattopadhyay remains a widely popular, translated, and adapted Indian author.

Columns

  • name: Name of the literary item.
  • collection: Collection to which the literary item belongs.
  • genre: Genre of the literary item.
  • content: Formatted content of the literary item.

Distribution

This free dataset comprises five files: three CSV files located in the /csv directory and two TXT files in the /txt directory. Each CSV file contains literary works organised by genre, with an all_collection.csv file combining works from all genres. Each TXT file presents aggregated literary works by genre. All content has undergone basic preprocessing to remove empty spaces and in-page titles. The TXT formats are ideal for training various sequential models, whilst the CSV formats are suitable for literary analyses and comparative studies. The dataset is version 1.0 and has a quality rating of 5 out of 5.

Usage

  • Training sequential models using the TXT formats.
  • Conducting literary analyses using the CSV formats.
  • Performing comparative studies of literary works using the CSV formats.
  • Exploring Bengali literature through machine learning techniques.

Coverage

The dataset covers the literary works of Sarat Chandra Chattopadhyay, an author active in the early 20th century (1876–1938). The context of the works primarily focuses on the social practices and life in Bengal villages.

License

CC0

Who Can Use It

  • Machine learning enthusiasts interested in natural language processing and sequential model training.
  • Literary experts and researchers for in-depth analyses of Bengali literature.
  • Academics and students studying social history and literary trends in early 20th-century Bengal.

Dataset Name Suggestions

  • Sarat Chandra Chattopadhyay: Complete Literary Works
  • Bengali Literature Collection by Sarat Chandra
  • Sarat Chandra Chattopadhyay Corpus
  • Classic Bengali Novels and Short Stories Dataset

Attributes

Listing Stats

VIEWS

1

DOWNLOADS

1

LISTED

27/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format