B99 Character Script Data
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Dialogues and transcripts from the acclaimed American television comedy Brooklyn Nine-Nine, capturing content from seasons 1 through 4. This text-based data product was initially created to serve as the foundation for a Discord Bot, filling a gap in the availability of structured script data for the series. It offers a valuable resource for natural language processing (NLP) tasks and detailed character analysis.
Columns
- name: Identifies the character who speaks the accompanying line of dialogue. Data quality indicates a high percentage of lines spoken by 'JAKE', though many other character names are present.
- line: Contains the exact spoken dialogue or utterance corresponding to the character listed in the
namecolumn. This column features over 6,000 unique values.
Distribution
The data is delivered in a CSV format, specifically named
Brooklyn99_Season1-4_Dataset.csv, with a file size of approximately 464.55 kB. The structure consists of 2 columns and contains 6,460 valid records in total.Usage
Ideal applications include developing conversational AI models, training dialogue generation systems, performing sentiment analysis on character arcs, and conducting linguistic studies focused on scripted television speech patterns. It is also suitable for learning text processing techniques using programming libraries such as pandas.
Coverage
The content covers the transcripts spanning Season 1 through Season 4 of the Brooklyn 99 television series. It is important to note that certain episodes within this range are missing from the collection. The language is English, and the content is generally considered globally relevant.
License
CC0: Public Domain
Who Can Use It
- NLP Researchers: To study dialogue flow and character-specific language models.
- Bot Developers: For creating character-based automated chat or Q&A systems.
- Data Science Enthusiasts: For practicing text cleaning, tokenisation, and basic text mining techniques.
- Media Scholars: To quantitatively analyse character representation and screenwriting styles.
Dataset Name Suggestions
- Brooklyn Nine-Nine Dialogues S1-4
- B99 Character Script Data
- Television Dialogue Transcripts (Brooklyn 99)
- Brooklyn 99 Conversation Dataset
Attributes
Original Data Source: B99 Character Script Data
Loading...
