§ feed · storyline
Researcher creates quirky benchmark for ChatGPT Images 2.0
Researcher publishes a informal image-generation benchmark testing ChatGPT Images 2.0 using a Where's Waldo-style prompt featuring a raccoon holding a ham radio.
I came up with a somewhat foolish new benchmark for testing image generation models, to exercise the new ChatGPT Images 2.0: "Do a where's Waldo style image but it's where is the raccoon holding a ham radio" simonwillison.net/2026/Apr/21/...
§ sources1 publication · timeline below