Master Pokemon Dataset and Corpus
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset contains information for Pokémon 1-1045 in the National Pokédex. It contains basic Pokédex information such as name, type, height, etc., move sets with move information, type effectiveness, abilities with descriptions, stats, brief Bulbapedia descriptions, and battle strategies from Smogon.
The Pokémon file contains 56 columns for each Pokémon's:
pokédex number: the number of the Pokémon as listed in the National Pokédex
name: name of Pokémon
japanese name: Japanese name of Pokémon
generation: generation Pokémon comes from
status: if the Pokémon is Legendary or not
species: species of Pokémon
type number: how many types it has
type 1: 1st type of Pokémon
type 2: 2nd type of Pokémon
height: height of Pokémon in meters
weight: weight of Pokémon in kilograms
abilities number: number of abilities it has
total points: amount total stats add up to
stats: column for each stat of Pokémon
HP
attack
defense
sp. attack
sp. defense
speed
catch rate: Pokémon's catch rate
base friendship: base friendship of Pokémon
base experience: base experience of Pokémon
growth rate: growth rate of Pokémon
egg type number: how many egg groups it is apart of
egg type 1: 1st egg group
egg type 2: 2nd egg group
percentage male: chance Pokémon is a male
egg cycles: number of cycles (255-257 steps) required to hatch an egg of the Pokémon
type effectiveness: column for each of Pokémon's type effectiveness against:
normal
fire
water
grass
electric
flying
ground
rock
fighting
psychic
dark
ghost
dragon
ice
fairy
poison
bug
steel
Smogon description: battle strategies from Smogon. Primarily from SM Pokédex. If not applicable, then from either SV, SS, XY, BW (depending on which generation had a strategy entry for the Pokémon)
Bulba description: first few sentences on the Bulbapedia page for each Pokémon
moves: dictionary of moves the Pokémon is able to learn by leveling up. Includes move:
name
type
damage type
power
accuracy
PP
level learned
secondary effect chance of occurrence
description
ability 1: ability Pokémon is able to have
ability 2: ability Pokémon is able to have
hidden ability: Pokémon's hidden ability
ability 1 description: description of ability 1 of Pokémon
ability 2 description: description of ability 2 of Pokémon
hidden ability description: description of hidden ability of Pokémon
The Poké corpus file contains:
A text corpus for each Pokémon created from all the information present in the Pokédex file.
Github & Uses
Medium Article:
Talk to your Data with Retrieval-Augmented Generation (RAG) or: How I Built a Pokémon Chatbot
Github:
To see the Jupyter Notebook in which this data was collected and combined:
GitHub Repo
How I used the data in a retrieval-augmented generation (RAG) pipeline:
GitHub Repo (or the attached code to this dataset)
Acknowledgements
This dataset was made by web scraping Bulbapedia, Smogon, and with data already compiled from the following datasets:
Mario Tormo Romero's Pokémon dataset here on Kaggle
Cayo Costa Sobral's Pokémon dataset here on Kaggle
And their respective contributors.
License
CC BY-SA
Original Data Source: Master Pokemon Dataset and Corpus