ErenJeager Dialogue Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is a collection of character dialogues from all seasons of the anime series "Attack on Titan". It is designed as a valuable resource for various natural language processing (NLP) tasks and sentiment analysis for researchers, data enthusiasts, and fans. The dataset features wide coverage across all episodes and seasons, capturing diverse characters and scenarios from the series in their original Japanese context. It is structured to include speaker names, episode/season information, and timestamp data, facilitating easy manipulation and analysis. Additionally, it offers sentiment annotations for selected dialogues, enabling exploration of emotional dynamics between characters. Multi-language support is also provided, with translations into several popular languages.
Columns
- Name: Represents the character's name who speaks the dialogue. Examples include Eren, Armin, and others.
- Line: Contains the actual dialogue spoken by the character.
Distribution
The dataset typically comes in a CSV file format and contains two primary columns. It encompasses an extensive collection of dialogues spanning all episodes and seasons of "Attack on Titan". Specific numbers for rows or records are not detailed in the available information.
Usage
This dataset is ideal for:
- Natural Language Processing (NLP): Training and evaluating various NLP models, including language generation, dialogue generation, sentiment analysis, and character-based language modelling.
- Sentiment Analysis: Leveraging sentiment annotations to analyse character emotions and shifts in attitudes during key moments.
- Conversational AI: Developing interactive chatbots or conversational agents based on "Attack on Titan" characters.
- Language Learning: Practising translation and understanding conversational nuances in different languages using its multi-language support.
Coverage
The dataset covers all seasons and episodes of the "Attack on Titan" anime series. The original dialogues are in Japanese, with translations available in several popular languages. Its scope is global, intended for use by a wide audience.
License
CC-BY-SA
Who Can Use It
- Researchers: For natural language processing tasks, sentiment analysis, language modelling, and emotion detection.
- Data Enthusiasts: For exploring and analysing character-based insights from the anime.
- Fans: To delve deeper into the series' dialogues and character dynamics.
- Developers: For building interactive chatbots or conversational AI based on the anime's characters.
- Language Learners: To practise translation and enhance understanding of conversational nuances across different languages.
Dataset Name Suggestions
- Attack on Titan Dialogue Dataset
- ErenJeager Dialogue Data
- Anime Character Dialogue Collection
- Attack on Titan NLP Data
- AOT Dialogue Analysis Set
Attributes
Original Data Source: ErenJeager