Bankim Chandra Chattopadhyay Literary Works
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset presents a collection of the complete literary works of Bankim Chandra Chattopadhyay, an influential Indian novelist, poet, essayist, and journalist. Chattopadhyay, also known as Sahitya Samrat (Emperor of Literature) in Bengali, is a significant figure in modern Bengali and Indian literature. His notable contributions include the 1882 Bengali novel Anandamath, considered a landmark work, and the composition of Vande Mataram. The latter, written in highly sanskritized Bengali, is renowned for personifying Bengal as a mother goddess and played an inspiring role for activists during the Indian Independence Movement. The collection includes fourteen novels and numerous serious, serio-comic, satirical, scientific, and critical treatises, offering a rich resource for deep exploration into Bengali literature.
Columns
- name: The title or name of the specific literary work.
- collection: Indicates the collection or larger grouping the work belongs to, if applicable.
- genre: Specifies the literary genre of the work (e.g., novel, essay, poem).
- content: The full textual content of the literary work, presented in Bengali.
Distribution
The dataset is provided in two primary formats: three CSV files and two TXT files. Each CSV file contains literary works organised by genre, with an additional
all_collection.csv
file combining works from all genres. The TXT files aggregate all literary works within a genre into a single text file. All content has undergone a basic preprocessing step to eliminate empty spaces and in-page titles, ensuring cleanliness. While specific numbers for rows or records are not available in the provided information, the structured CSV format is well-suited for literary analyses and comparative studies, whereas the aggregated TXT format is ideal for training various sequential models.Usage
This dataset is well-suited for a variety of applications and uses, including:
- Training various sequential models in natural language processing (NLP).
- Literary analyses of Bankim Chandra Chattopadhyay's works.
- Comparative studies within Bengali literature or across different literary traditions.
- Exploring the depth and richness of Bengali literature for general literary experts.
Coverage
The dataset covers the complete works of Bankim Chandra Chattopadhyay, whose life spanned from 1838 to 1894. The literary content is in the Bengali language, reflecting the cultural and linguistic landscape of Bengal during his era. The scope is specifically focused on his individual literary output.
License
CC0
Who Can Use It
This dataset is an excellent resource for:
- Machine learning enthusiasts looking for Bengali text data to train language models, perform sentiment analysis, or develop other NLP applications.
- General literary experts interested in studying the works of a pivotal figure in Indian literature, conducting textual analysis, or understanding the nuances of 19th-century Bengali writing.
- Researchers in digital humanities, cultural studies, and linguistics.
Dataset Name Suggestions
- Bankim Chandra Chattopadhyay Literary Works
- Bengali Literature Collection by Bankim Chandra
- Bankim Chandra's Complete Works
- Anandamath and Other Works by Bankim
Attributes
Original Data Source: Complete Works of Bankim Chandra Chattopadhyay