Opendatabay APP

Bankim Chandra Chattopadhyay Literary Works

Entertainment & Media Consumption

Tags and Keywords

Arts

Entertainment

Literature

Nlp

Languages

Bengali

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Bankim Chandra Chattopadhyay Literary Works Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset presents a collection of the complete literary works of Bankim Chandra Chattopadhyay, an influential Indian novelist, poet, essayist, and journalist. Chattopadhyay, also known as Sahitya Samrat (Emperor of Literature) in Bengali, is a significant figure in modern Bengali and Indian literature. His notable contributions include the 1882 Bengali novel Anandamath, considered a landmark work, and the composition of Vande Mataram. The latter, written in highly sanskritized Bengali, is renowned for personifying Bengal as a mother goddess and played an inspiring role for activists during the Indian Independence Movement. The collection includes fourteen novels and numerous serious, serio-comic, satirical, scientific, and critical treatises, offering a rich resource for deep exploration into Bengali literature.

Columns

  • name: The title or name of the specific literary work.
  • collection: Indicates the collection or larger grouping the work belongs to, if applicable.
  • genre: Specifies the literary genre of the work (e.g., novel, essay, poem).
  • content: The full textual content of the literary work, presented in Bengali.

Distribution

The dataset is provided in two primary formats: three CSV files and two TXT files. Each CSV file contains literary works organised by genre, with an additional all_collection.csv file combining works from all genres. The TXT files aggregate all literary works within a genre into a single text file. All content has undergone a basic preprocessing step to eliminate empty spaces and in-page titles, ensuring cleanliness. While specific numbers for rows or records are not available in the provided information, the structured CSV format is well-suited for literary analyses and comparative studies, whereas the aggregated TXT format is ideal for training various sequential models.

Usage

This dataset is well-suited for a variety of applications and uses, including:
  • Training various sequential models in natural language processing (NLP).
  • Literary analyses of Bankim Chandra Chattopadhyay's works.
  • Comparative studies within Bengali literature or across different literary traditions.
  • Exploring the depth and richness of Bengali literature for general literary experts.

Coverage

The dataset covers the complete works of Bankim Chandra Chattopadhyay, whose life spanned from 1838 to 1894. The literary content is in the Bengali language, reflecting the cultural and linguistic landscape of Bengal during his era. The scope is specifically focused on his individual literary output.

License

CC0

Who Can Use It

This dataset is an excellent resource for:
  • Machine learning enthusiasts looking for Bengali text data to train language models, perform sentiment analysis, or develop other NLP applications.
  • General literary experts interested in studying the works of a pivotal figure in Indian literature, conducting textual analysis, or understanding the nuances of 19th-century Bengali writing.
  • Researchers in digital humanities, cultural studies, and linguistics.

Dataset Name Suggestions

  • Bankim Chandra Chattopadhyay Literary Works
  • Bengali Literature Collection by Bankim Chandra
  • Bankim Chandra's Complete Works
  • Anandamath and Other Works by Bankim

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

24/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format