National Pokédex Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides detailed information for Pokémon from 1 to 1045, as listed in the National Pokédex. It includes fundamental Pokédex entries such as their names, types, and physical attributes, alongside more in-depth data like move sets, type effectiveness, abilities with full descriptions, and battle strategies sourced from Smogon. Additionally, the dataset contains brief descriptions from Bulbapedia. A distinct text corpus file is also included, offering a textual representation for each Pokémon, compiled from all the details present in the main Pokédex file.
Columns
The main Pokémon file features 56 columns, providing extensive details for each creature. Key columns include:
- pokédex number: The official National Pokédex identification number.
- name: The English name of the Pokémon.
- japanese name: The Japanese name of the Pokémon.
- generation: The generation number the Pokémon originates from.
- status: Indicates if the Pokémon is Legendary.
- species: The specific species of the Pokémon.
- type number: How many elemental types the Pokémon possesses.
- type 1: The primary elemental type.
- type 2: The secondary elemental type, if applicable.
- height: The Pokémon's height in metres.
- weight: The Pokémon's weight in kilograms.
- abilities number: The count of abilities it can have.
- total points: The sum of all base stats.
- stats: Individual columns for key battle statistics: HP, attack, defence, special attack, special defence, and speed.
- catch rate: The Pokémon's catch rate.
- base friendship: The base friendship value.
- base experience: The base experience yield.
- growth rate: The growth rate category.
- egg type number: The number of egg groups it belongs to.
- egg type 1: The primary egg group.
- egg type 2: The secondary egg group, if applicable.
- percentage male: The likelihood of the Pokémon being male.
- egg cycles: The number of steps required to hatch an egg.
- type effectiveness: Columns detailing effectiveness against various types (e.g., normal, fire, water, grass, electric, flying, ground, rock, fighting, psychic, dark, ghost, dragon, ice, fairy, poison, bug, steel).
- Smogon description: Battle strategies primarily from SM Pokédex, or other generations if more relevant.
- Bulba description: Initial sentences from the Pokémon's Bulbapedia page.
- moves: A dictionary detailing moves the Pokémon learns by levelling up, including name, type, damage type, power, accuracy, PP, level learned, secondary effect chance, and description.
- ability 1, ability 2, hidden ability: The names of the Pokémon's abilities.
- ability 1 description, ability 2 description, hidden ability description: Descriptions for each of the Pokémon's abilities.
The accompanying Poké corpus file contains a text corpus for each Pokémon, generated by consolidating all the information from the Pokédex file.
Distribution
This dataset encompasses information for Pokémon numbered 1 through 1045. The primary Pokémon data file contains 56 distinct columns for each entry. While specific row counts are not provided, there are 1045 unique Pokémon entries detailed. Data files are typically provided in CSV format.
Usage
This dataset is ideally suited for a variety of applications, particularly in the fields of artificial intelligence, machine learning, and data analysis related to gaming and entertainment.
- Building AI Chatbots: Useful for creating conversational agents, such as a Pokémon chatbot, through retrieval-augmented generation (RAG) pipelines.
- Game Development: Provides extensive data for developers creating Pokémon-inspired games or applications.
- Data Analysis: Researchers and enthusiasts can analyse Pokémon stats, moves, and abilities for competitive strategy or general insights.
- Natural Language Processing (NLP): The text corpus can be used for text generation, entity recognition, and other NLP tasks related to Pokémon lore.
Coverage
The dataset covers Pokémon from number 1 to 1045 in the National Pokédex. Its scope is global, providing information relevant to all regions where Pokémon are known. There are no specific notes on data availability for certain groups or years beyond the stated Pokédex range.
License
CC BY-SA
Who Can Use It
- Data Scientists and AI/ML Developers: For training models, building recommendation systems, or developing chatbots and other AI applications using the detailed Pokémon attributes and text corpus.
- Game Developers: To integrate accurate and detailed Pokémon information into their projects.
- Researchers: For academic studies on game design, character attributes, or data structures in entertainment.
- Pokémon Enthusiasts and Community Developers: For fan-made applications, wikis, or statistical analyses of the Pokémon universe.
Dataset Name Suggestions
- Master Pokémon Data & Corpus
- National Pokédex Dataset
- Pokémon Battle Statistics Compendium
- Ultimate Pokémon Data Collection
- Pokémon Info & Text Corpus
Attributes
Original Data Source: Master Pokemon Dataset and Corpus