Opendatabay APP

Master Pokemon Dataset and Corpus

Entertainment & Media Consumption

Tags and Keywords

Earth and Nature

Arts and Entertainment

Movies and TV Shows

Video Games

Games

Anime and Manga

NLP

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Master Pokemon Dataset and Corpus Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset contains information for Pokémon 1-1045 in the National Pokédex. It contains basic Pokédex information such as name, type, height, etc., move sets with move information, type effectiveness, abilities with descriptions, stats, brief Bulbapedia descriptions, and battle strategies from Smogon.
The Pokémon file contains 56 columns for each Pokémon's: pokédex number: the number of the Pokémon as listed in the National Pokédex name: name of Pokémon japanese name: Japanese name of Pokémon generation: generation Pokémon comes from status: if the Pokémon is Legendary or not species: species of Pokémon type number: how many types it has type 1: 1st type of Pokémon type 2: 2nd type of Pokémon height: height of Pokémon in meters weight: weight of Pokémon in kilograms abilities number: number of abilities it has total points: amount total stats add up to stats: column for each stat of Pokémon HP attack defense sp. attack sp. defense speed catch rate: Pokémon's catch rate base friendship: base friendship of Pokémon base experience: base experience of Pokémon growth rate: growth rate of Pokémon egg type number: how many egg groups it is apart of egg type 1: 1st egg group egg type 2: 2nd egg group percentage male: chance Pokémon is a male egg cycles: number of cycles (255-257 steps) required to hatch an egg of the Pokémon type effectiveness: column for each of Pokémon's type effectiveness against: normal fire water grass electric flying ground rock fighting psychic dark ghost dragon ice fairy poison bug steel Smogon description: battle strategies from Smogon. Primarily from SM Pokédex. If not applicable, then from either SV, SS, XY, BW (depending on which generation had a strategy entry for the Pokémon) Bulba description: first few sentences on the Bulbapedia page for each Pokémon moves: dictionary of moves the Pokémon is able to learn by leveling up. Includes move: name type damage type power accuracy PP level learned secondary effect chance of occurrence description ability 1: ability Pokémon is able to have ability 2: ability Pokémon is able to have hidden ability: Pokémon's hidden ability ability 1 description: description of ability 1 of Pokémon ability 2 description: description of ability 2 of Pokémon hidden ability description: description of hidden ability of Pokémon The Poké corpus file contains: A text corpus for each Pokémon created from all the information present in the Pokédex file.
Github & Uses Medium Article:
Talk to your Data with Retrieval-Augmented Generation (RAG) or: How I Built a Pokémon Chatbot
Github:
To see the Jupyter Notebook in which this data was collected and combined: GitHub Repo
How I used the data in a retrieval-augmented generation (RAG) pipeline: GitHub Repo (or the attached code to this dataset)
Acknowledgements This dataset was made by web scraping Bulbapedia, Smogon, and with data already compiled from the following datasets:
Mario Tormo Romero's Pokémon dataset here on Kaggle
Cayo Costa Sobral's Pokémon dataset here on Kaggle
And their respective contributors.

License

CC BY-SA
Original Data Source: Master Pokemon Dataset and Corpus

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

16/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free