Daily Word Puzzle Lexicon
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset contains the official lexicon of valid words used in the popular daily word puzzle game, Wordle. The game's objective is to guess a hidden five-letter word within six attempts. After each guess, the game provides feedback by changing the colour of tiles to show how close your guess was to the correct word. This dataset was created by inspecting the game's page source and includes two distinct lists: one for all valid words that can be submitted as guesses, and another for all valid words that can be solutions. It offers a valuable resource for analysing Wordle's mechanics and exploring the derivation of optimal strategies.
Columns
The primary data file,
valid_guesses.csv
, consists of a single column:word
: This column lists valid words permitted as guesses in the Wordle game. It contains 10,657 unique values, indicating a 100% valid and unique set of entries with no mismatched or missing data. The most frequently occurring word observed in this list is 'aahed'.
Distribution
The dataset is typically provided in CSV format. The
valid_guesses.csv
file has a size of 74.61 kB and contains approximately 10.7 thousand records (words). A sample file will be updated separately to the platform.Usage
This dataset is ideal for various applications, including:
- Developing optimal strategies for playing Wordle by understanding the permissible guesses and solutions.
- Creating AI or algorithmic solvers for the Wordle game.
- Analysing linguistic patterns and common word structures within the Wordle lexicon.
- Educational purposes in exploring dictionaries and data structures.
- Developing similar word puzzle games or enhancing existing ones.
Coverage
The dataset's scope is strictly confined to the lexicon of the Wordle game, as derived from its internal source. It does not specify geographic or demographic coverage. It represents a static snapshot of the words available within the game, as there are no expected updates.
License
CC0: Public Domain
Who Can Use It
This dataset is particularly useful for:
- Data scientists and analysts: For exploring word frequencies and creating data-driven game strategies.
- Game developers: Interested in the mechanics and word lists of successful puzzle games.
- Puzzle enthusiasts: Seeking to understand the underlying structure of Wordle and improve their gameplay.
- Researchers: In fields such as natural language processing or artificial intelligence, for text analysis and strategy development.
Dataset Name Suggestions
- Wordle Valid Words
- Wordle Lexicon Dataset
- Wordle Game Dictionary
- Daily Word Puzzle Lexicon
- Wordle Guesses and Solutions
Attributes
Original Data Source: Daily Word Puzzle Lexicon