Modi's Digital Speech Collection
Government & Civic Records
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a digital archive of text speeches delivered by Narendra Damodaradas Modi, the 14th and current Prime Minister of India, who has served since 2014. Previously, he was the Chief Minister of Gujarat from 2001 to 2014. Known for his exceptional oratorical skills and ability to connect with the common person, this collection offers access to his public addresses starting from 2018. The speeches are primarily translated from Hindi into English, making them accessible for a wider audience. This dataset is invaluable for researchers and analysts interested in political discourse, government communication, and linguistic studies.
Columns
- id: A unique identifier for each speech record.
- url: The URL from which the speech data was originally scraped.
- title: The official title given to the speech or public engagement.
- article_text: The full text content of the speech.
- images: URLs linking to images taken during the speech event, predominantly featuring Narendra Modi.
- publish_info: Details regarding the publication of the speech, including the author, date, and time.
- tags: Keywords or tags assigned from the original scraped website, useful for categorising the speeches.
Distribution
The dataset typically comes in a CSV file format. A sample file is available separately on the platform. It contains approximately 987 unique records or rows, detailing various speeches and public engagements. Each record follows a structured format, providing consistent data for analysis.
Usage
This dataset is ideal for various applications and use cases, including:
- Natural Language Processing (NLP) tasks, such as sentiment analysis, topic modelling, and speech recognition training.
- Analysing Narendra Modi's oratorical style, vocabulary, and recurring themes in his public addresses.
- Studying the evolution of government communication and policy messaging over time.
- Research into Indian politics, public opinion, and the impact of political discourse.
- Categorising speeches based on their content, context, or associated tags.
Coverage
The dataset's geographic scope is India, focusing on the public addresses of its Prime Minister. The time range for the speeches included commences from 2018 onwards. All speeches within the dataset are primarily translated from Hindi to English, ensuring linguistic accessibility for international users.
License
CC0
Who Can Use It
This free dataset is particularly useful for:
- Academic researchers in political science, linguistics, and social sciences.
- Data analysts and scientists performing NLP or textual analysis on political content.
- Journalists and media professionals seeking primary source material for reporting on Indian politics.
- Historians and archivists building collections of significant public records.
- Anyone interested in the speeches and public statements of a prominent world leader.
Dataset Name Suggestions
- Narendra Modi Speeches Archive
- Indian Prime Minister Modi's Public Addresses
- Modi's Digital Speech Collection
- Narendra Modi: Speeches and Engagements
Attributes
Original Data Source: Narendra Modi - Text Speeches