Superheroes and Villains Variant Data
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This resource offers detailed general information on Marvel superheroes and supervillains. The dataset aims to provide extensive coverage of character variants existing across the Marvel Multiverse, built through scraping data from Marvel Fandom. Containing over 90,000 entries, it provides substantial insight into character identities, affiliations, and core attributes across various universes.
Columns
- PageID: Identifier of the character as listed in the wiki.
- Name: The main or original name of the character within that specific Universe.
- Universe: Specifies the variant of the character by their resident Universe (e.g., Earth-616).
- URL: The unique link directing to the character's wiki page.
- Identity: Records whether the character maintains a secret or public identity.
- Gender: Notes the character's stated gender.
- Alive: Classifies whether the character is alive or deceased in that particular Universe.
- Marital Status: Indicates whether the character variant is married or not.
- Teams: The affiliations (past or present) to which the character belongs in their Universe.
- Height (in m): Height measurement of the character in meters.
- Weight (in kg): Weight measurement of the character in kilograms.
- Origin: Specifies the race of origin for the character.
- Creators: Names of the artists responsible for creating the character.
Distribution
The dataset is typically structured as a CSV file (
marvel_characters_dataset.csv) and has a file size of approximately 17.53 MB. It contains over 92,600 entries distributed across 14 columns. Core columns like PageID and URL contain a very high number of unique values, reflecting the large number of distinct character entries. The most represented Universe is Earth-616 (36% of records). It should be noted that physical measurement columns (Height and Weight) have a high percentage of missing values (94% and 95%, respectively).Usage
This data is highly suitable for statistical analysis of fictional populations, examining character origins, and understanding team dynamics within the Marvel cannon. Ideal applications include creating visualisations of character demographics, building models to analyse the distribution of variants across the Multiverse, or conducting research into popular culture trends relating to superheroes and villains.
Coverage
The scope is purely fictional, covering the multitude of characters, superheroes, and supervillains existing across the Marvel Multiverse, with specific focus on universes such as Earth-616 and Earth-199999. The character records include specific attributes related to identity, gender, and life status. The data collection is expected to be updated annually.
License
CC0: Public Domain
Who Can Use It
- Academics and Researchers: Analysing the creation and evolution of comic book characters and narratives.
- Data Scientists: Performing detailed statistical analysis on character attributes like physical traits and origins.
- Developers and Enthusiasts: Building interactive applications, fan databases, or new informational resources based on Marvel characters.
Dataset Name Suggestions
- Marvel Multiverse Character Archive
- Superheroes and Villains Variant Data
- Fandom Scraped Marvel Character Traits
- Detailed Marvel Universe Demographics
Attributes
Original Data Source: Superheroes and Villains Variant Data
Loading...
