Optimizing Wordle guesses

Published: Jan 17, 2022 by Steve Baskauf

I’ve recently started playing the viral word game “Wordle”. After numerous discussions among family members about guessing strategy, I decided to spend some time over the holiday weekend writing a Python script to play with different guessing algorithms and strategies.

The details are in a blog post, but if you don’t like reading text and just want to start playing with the code, you can go straight to this Colab notebook to run and hack the code yourself.

The TLDR summary of the blog post is:

  1. I generated a list of five-letter English words of a similar size to what is used in the game.
  2. I created a Wordle_list object that screens word lists based on “guess codes” (an alphanumeric version of the colored squares that the app shows you after you guess).
  3. I messed around with different ways to use letter frequencies in the words to assign scores to words in a list based on estimating how effective the words would be at screening out the other words.
  4. I wrote some code to generate the most effective words to use as automatic first two guesses (spoiler alert). They are “arose” and “glint”.
  5. I used existing Wordle game words to test whether it was more effective to just use these two words, or to let the scoring system chose the second word. It didn’t make much difference.

Enjoy!

Share

Latest Posts

Enabling ecological survey data integration
Enabling ecological survey data integration

Our paper describing the Humboldt Extension to Darwin Core has been published in Ecography. The Humboldt Extension adds 55 terms that enrich the Darwin Core, providing the terms needed to capture and share multiple types of biodiversity survey data. The paper illustrates the benefits of implementing the Humboldt Extension with three case studies and demonstrates how richer data can be used in research, modelling, and to inform decision-making.

Biological survey and monitoring data publishing guide
Biological survey and monitoring data publishing guide

My coauthors and I have published a guide to help people understand how to use the new Humboldt Extension for Biological Inventories of the Darwin Core standard. The guide includes diagrams and detailed information about how to structure the data to capture the hierarchical structure typically found in monitoring projects.

Open Science recipes published
Open Science recipes published

My colleage from the Vanderbilt Libraries’ Digital Lab, Shenmeng Xu, an I have published two chapters in the ACRL’s 2025 Open Science Cookbook. The Cookbook is a lighthearted take on technical topics where instructions are given in “recipe” format to teach beginners new tech skills.