Persian Poetic Works Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is a substantial collection of Persian modernist poems, gathered from Iranian contemporary poets. It serves as a large corpus of over 4,000 poems, originally web scraped from shereno.com. Each poem is accompanied by its key metadata, including the poem's title, the name of the poet, and the book in which the poem was published. This resource offers a rich body of work for analysis and study within the realm of modern Persian literature.
Columns
- Poem: Contains the complete text of the poems.
- Poet: Specifies the name of the poet who authored the poem. There are 4,397 unique poet values, with notable contributors such as قاسم حسن نژاد (9%) and منوچهر آتشی (8%). The remaining 83% are attributed to 3,647 other unique poets.
- Title: Provides the title of the poem. This column features 3,967 unique titles, including examples like اولین غم و آخرین نگاه (2%) and لحظه ها و صحنه ها (2%), with 95% of titles attributed to 4,200 other unique entries.
- Book: Indicates the name of the book in which the poem was published.
Distribution
The dataset is typically provided as a CSV data file. It encompasses over 4,000 modernist poems, making it a sizable corpus. While a precise number of rows or records is not specified, the unique counts for poets and titles indicate its extensive nature.
Usage
This dataset is ideal for a variety of applications, including:
- Natural Language Processing (NLP) research and development.
- Literary analysis of Persian modernist poetry.
- Training and evaluating Large Language Models (LLMs) on specific linguistic and cultural contexts.
- Academic research in Iranian literature and cultural studies.
- Developing applications related to text generation, sentiment analysis, or topic modelling within the domain of poetry.
Coverage
The dataset focuses on Persian modernist poetry specifically from Iranian contemporary poets. Its content is globally available, making it accessible for researchers and developers worldwide. The scope is primarily linguistic and cultural, without specific notes on data availability for particular demographic groups or narrow time ranges beyond "contemporary."
License
CC0
Who Can Use It
This dataset is suitable for:
- Researchers and academics studying Persian literature, poetry, and linguistics.
- Data scientists and NLP engineers working on text analysis, language models, and content generation.
- Developers creating applications that require a rich corpus of poetic text.
- Cultural enthusiasts and students interested in gaining deeper insights into Iranian contemporary poetry.
Dataset Name Suggestions
- Shereno Persian Modernist Poetry
- Iranian Contemporary Poetry Corpus
- Persian Poetic Works Dataset
- Modern Persian Verse Collection
- Shereno: A Collection of Persian Poems
Attributes
Original Data Source: Shereno: A Dataset of Persian Modernist Poetry