Breaking Bad Series Connections Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is specifically created for the network analysis of the Breaking Bad television series. Its primary purpose is to address the current unavailability of a dedicated relationship dataset for the show, providing generated data from episode summaries suitable for graph network analysis. The project draws inspiration from DataCamp's pioneering work on network analysis for Game of Thrones [1].
Columns
The dataset contains information detailing characters present throughout the series, collected on a season-by-season basis. The data is available in an uncleaned version [2].
- Season: Denotes the specific season identifier for which the data is provided [2].
- Season name: Specifies the corresponding name of the season [2].
- Characters: Lists characters found within the series, acting as nodes for network analysis [2].
- Actors Name and Character name: Provides the name of the actor alongside the character they portray, aiding in precise identification [2]. The data highlights character distribution, indicating, for example, that 22% of characters are in Season 4 and 21% in Season 3, with 57% designated as 'Other' across seasons. Furthermore, Bryan Cranston as Walter White and Aaron Paul as Jesse Pinkman each represent 5% of a particular character distribution, while 90% fall under 'Other' characters [2].
Distribution
The dataset is structured to facilitate network analysis, with information gathered per season [1, 2]. While specific counts for rows or records are not detailed in the provided sources, it is common for such data files to be presented in CSV format [3]. The dataset maintains a quality rating of 5 out of 5 and is currently at Version 1.0 [4]. It is listed as a free dataset [1, 4].
Usage
This dataset is ideally suited for various applications, including:
- Performing network analysis on the Breaking Bad series to map and understand character interdependencies [1].
- Conducting graph network analysis to visualise and explore the dynamics of relationships within the show's narrative [1].
- Serving as a basis for projects involving natural language processing (NLP), text mining, and data pre-processing, given its origin from episode summaries [1].
Coverage
The dataset's scope encompasses the entirety of the Breaking Bad television series, with character presence and distribution collected across all its seasons [1, 2]. Its regional availability is noted as Global [4]. Information regarding demographics beyond character and actor names is not explicitly provided in the sources [2].
License
CC-BY-SA
Who Can Use It
The dataset is intended for use by a range of individuals and organisations:
- Data analysts and scientists with an interest in applying network science to popular culture and media.
- Academic researchers studying television narrative, character development, or social structures within fictional works.
- Developers working on AI and LLM models that require structured data for training or analysis related to entertainment content.
- Fans of Breaking Bad who wish to explore the series' intricate relationships through a data-driven lens.
Dataset Name Suggestions
- Breaking Bad: Network Analysis
- Breaking Bad Character Relationship Graph
- Breaking Bad Series Connections Dataset
- Breaking Bad Episode Character Network
Attributes
Original Data Source: Breaking Bad : Network Analysis