Hillary Clinton Donald Trump Speech Transcripts
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is a small corpus of transcribed campaign speeches delivered by Hillary Clinton and Donald Trump in the run-up to the 2016 election. It is designed to support data science and analytics, particularly in the fields of text analysis, politics, and Natural Language Processing (NLP). The primary purpose is to enable users to analyse the speech patterns and content of either candidate or both.
Columns
- doc_id: This column contains a unique document identifier, which also embeds metadata such as the speaker and the date of the speech.
- text: This column holds the transcribed text of the speeches. All non-speaker data, including paralinguistic descriptions and speech from other individuals like Master of Ceremonies, are enclosed within angle brackets (
<>
). For accurate analysis of the candidates' speech, users should omit these bracketed data during processing.
Distribution
The dataset is provided as a .csv file. It represents a small corpus of data. While the exact number of rows or records is not specified, it contains two primary columns:
doc_id
and text
, structured to facilitate easy parsing and analysis.Usage
This dataset is ideal for various applications, including:
- Text analysis of political discourse.
- Linguistic studies focusing on campaign rhetoric.
- Developing and testing Natural Language Processing (NLP) models.
- Sentiment analysis of political speeches.
- Academic research into the 2016 US presidential election campaigns.
Coverage
The dataset covers transcribed speeches from Hillary Clinton and Donald Trump delivered during the period leading up to the 2016 United States election. The scope is specific to these two candidates and that particular election cycle.
License
CC-BY
Who Can Use It
This dataset is suitable for:
- Data scientists and data analysts for text mining and NLP projects.
- Researchers in political science, linguistics, and communication studies.
- Students undertaking projects on election campaigns or discourse analysis.
- Anyone interested in the raw textual data from the 2016 US presidential campaign speeches for analytical purposes.
Dataset Name Suggestions
- Clinton Trump 2016 Election Speeches
- US Presidential Campaign Corpus 2016
- Hillary Clinton Donald Trump Speech Transcripts
- 2016 US Election Campaign Text Data
Attributes
Original Data Source: Clinton/Trump Corpus