Shakespeare Dialogue Analysis Dataset
News & Media Articles
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset explores dialogue from several of William Shakespeare's notable plays, including Hamlet, Macbeth, and Romeo and Juliet. It provides a unique opportunity to delve into the textual fabric of these timeless works. The data originates from shakespeare.mit.edu, which has been offering Shakespeare's plays and poetry to the internet community since 1993, with this specific collection curated by Nicola Rennie. Its main purpose is to facilitate analysis of dialogue patterns, character speaking habits, and the presence of stage directions within these classic works.
Columns
- act: Represents the specific act number within a play.
- scene: Indicates the scene number for each line of dialogue or stage direction.
- character: Identifies the name of the character speaking or denotes if the entry is a stage direction.
- dialogue: Contains the actual text spoken by a character or the description of a stage direction.
- line_number: Provides a sequential number for each line of dialogue.
Distribution
The dataset is typically provided in a CSV file format. The provided sample,
hamlet.csv
, is approximately 296.03 KB in size and contains 5 columns. It includes around 4217 individual records or rows, making it a manageable size for detailed textual analysis.Usage
This dataset is ideal for various analytical tasks, such as:
- Determining which play features the highest proportion of stage directions relative to dialogue.
- Identifying the plays or characters with the longest lines of dialogue.
- Analysing character speaking frequency and dominance within scenes.
- Conducting textual analysis and natural language processing on historical literary texts.
- Exploring thematic patterns and linguistic nuances across Shakespeare's works.
Coverage
The geographic scope of the content pertains to the UK, and the time period covered is historical, aligning with the eras depicted in Shakespeare's plays. Specifically, the dataset focuses on dialogues from Hamlet, Macbeth, and Romeo and Juliet, offering a focused scope on these celebrated tragedies.
License
CC0: Public Domain
Who Can Use It
- Literary Researchers and Academics: To conduct in-depth textual analysis, study character development through dialogue, or explore structural elements of Shakespearean plays.
- Students: As an educational resource for understanding play structure, character interaction, and historical literary language.
- Theatre Practitioners: To gain insights into pacing, character emphasis, and stage instructions directly from the text.
- Data Scientists and NLP Enthusiasts: For applying natural language processing techniques to historical data, identifying linguistic patterns, and building text analysis models.
Dataset Name Suggestions
- Shakespearean Play Dialogues
- Hamlet Macbeth Romeo and Juliet Dialogue
- Classic British Play Transcripts
- Shakespeare Dialogue Analysis Dataset
- UK Historical Play Scripts
Attributes
Original Data Source: Shakespeare Dialogue Analysis Dataset