Opendatabay APP

Daily Word Puzzle Lexicon

Data Science and Analytics

Tags and Keywords

Wordle

Puzzles

Words

Games

Lexicon

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Daily Word Puzzle Lexicon Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset contains the official lexicon of valid words used in the popular daily word puzzle game, Wordle. The game's objective is to guess a hidden five-letter word within six attempts. After each guess, the game provides feedback by changing the colour of tiles to show how close your guess was to the correct word. This dataset was created by inspecting the game's page source and includes two distinct lists: one for all valid words that can be submitted as guesses, and another for all valid words that can be solutions. It offers a valuable resource for analysing Wordle's mechanics and exploring the derivation of optimal strategies.

Columns

The primary data file, valid_guesses.csv, consists of a single column:
  • word: This column lists valid words permitted as guesses in the Wordle game. It contains 10,657 unique values, indicating a 100% valid and unique set of entries with no mismatched or missing data. The most frequently occurring word observed in this list is 'aahed'.

Distribution

The dataset is typically provided in CSV format. The valid_guesses.csv file has a size of 74.61 kB and contains approximately 10.7 thousand records (words). A sample file will be updated separately to the platform.

Usage

This dataset is ideal for various applications, including:
  • Developing optimal strategies for playing Wordle by understanding the permissible guesses and solutions.
  • Creating AI or algorithmic solvers for the Wordle game.
  • Analysing linguistic patterns and common word structures within the Wordle lexicon.
  • Educational purposes in exploring dictionaries and data structures.
  • Developing similar word puzzle games or enhancing existing ones.

Coverage

The dataset's scope is strictly confined to the lexicon of the Wordle game, as derived from its internal source. It does not specify geographic or demographic coverage. It represents a static snapshot of the words available within the game, as there are no expected updates.

License

CC0: Public Domain

Who Can Use It

This dataset is particularly useful for:
  • Data scientists and analysts: For exploring word frequencies and creating data-driven game strategies.
  • Game developers: Interested in the mechanics and word lists of successful puzzle games.
  • Puzzle enthusiasts: Seeking to understand the underlying structure of Wordle and improve their gameplay.
  • Researchers: In fields such as natural language processing or artificial intelligence, for text analysis and strategy development.

Dataset Name Suggestions

  • Wordle Valid Words
  • Wordle Lexicon Dataset
  • Wordle Game Dictionary
  • Daily Word Puzzle Lexicon
  • Wordle Guesses and Solutions

Attributes

Original Data Source: Daily Word Puzzle Lexicon

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

13/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format