§ feed · storyline

Researcher creates quirky benchmark for ChatGPT Images 2.0

Researcher publishes a informal image-generation benchmark testing ChatGPT Images 2.0 using a Where's Waldo-style prompt featuring a raccoon holding a ham radio.

Apr 21 · 22:36:09 · primary fetch1 sourceupdated Apr 21 · 22:36:09

I came up with a somewhat foolish new benchmark for testing image generation models, to exercise the new ChatGPT Images 2.0: "Do a where's Waldo style image but it's where is the raccoon holding a ham radio" simonwillison.net/2026/Apr/21/...

read full article on bsky.app ↗

§ sources1 publication · timeline below

bsky.appI came up with a somewhat foolish new benchmark for testing image generation models, to exercise the new ChatGPT Images 2.0:primary22:36:09