DC Superheroes and Supervillains Variant Data
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Information concerning DC Universe superheroes and supervillains across various universes. This product compiles general character data, attempting to capture all existing variants of figures within the DC Multiverse. It provides a detailed look at affiliations, identities, creators, and origins for a vast collection of heroes and antagonists. Scraped from DC Fandom, the data set offers more than 30,000 unique entries and is expected to receive annual updates.
Columns
- PageID: The identifying number for the character entry within the wiki.
- Name: The original or primary name of the character associated with that specific universe.
- Universe: Specifies the variant of the character, noting which specific DC Universe they belong to (e.g., New Earth, Prime Earth).
- URL: A unique link directing users to the character’s specific wiki page.
- Identity: Indicates whether the character possesses a secret or public identity.
- Gender: Details the character's gender, with the majority listed as Male (70%).
- Marital Status: Shows whether the character variant is married or not, although this field has a substantial percentage of missing data (48%).
- Teams: Lists the groups or teams to which the variant is affiliated, either currently or historically, within their respective universe.
- Weight (in kg): The weight of the character measured in kilograms. Note that this field has significant missing values, with approximately 92% of entries being null.
- Creators: Provides the names of the artists or writers responsible for creating the character, with 'Bill Finger; Bob Kane' being a notable entry.
Distribution
The data product is available as a CSV file (dc_characters_dataset.csv) and contains approximately 31.5 thousand valid rows of character information, structured across 11 columns. Key metadata indicates that the PageID and URL fields contain 31,465 unique values, matching the total record count. Data quality varies; while fields like Name and Universe are generally well-populated, Weight (in kg) and Marital Status have high proportions of missing information.
Usage
The data is ideal for analysis projects focused on character distribution across universes, statistical comparisons of hero and villain attributes, tracking team affiliations, and studying character naming conventions and identity prevalence. It is highly suitable for building applications, visualisations, or machine learning models centered on popular culture and the DC Multiverse.
Coverage
The scope includes general information covering all superheroes and supervillains belonging to the DC Universe. Data availability spans different variants of characters existing within the Multiverse, noting the specific Universe for each entry. The data represents information as scraped from the DC Fandom wiki.
License
CC0: Public Domain
Who Can Use It
- Academics and Researchers: To conduct sociological studies on gender representation or identity concepts within comic book narratives.
- Data Analysts: To explore the relationship between creators, character variants, and universe timelines.
- Game Developers: To populate character databases for fan-made games or quizzes based on DC lore.
- Writers and Journalists: To draw statistical insights and supporting data for articles on the history and evolution of DC characters.
Dataset Name Suggestions
- DC Multiverse Character Details
- DC Superheroes and Supervillains Variant Data
- DC Fandom Character Repository
- DC Comics Universe Information.
Attributes
Original Data Source: DC Superheroes and Supervillains Variant Data