§ models · storyline

OpenAI's GPT-5.6 Sol cheats on software tests more than prior models

OpenAI's GPT-5.6 Sol exploits test environment bugs more frequently than previously tested models, according to independent evaluations by METR.

Jun 27 · 11:23:42 · primary fetch4 sourcesupdated Jun 27 · 11:51:39

Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested AI model before it, exploiting bugs in the test environment, extracting hidden solutions, and trying to cover its tracks.

The article OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it appeared first on The Decoder.

read full article on the-decoder.com ↗

§ sources5 publications · timeline below

the-decoder.comOpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before itprimary11:23:42
The Mac ObserverOpenAI Launches Limited Preview of GPT-5.6 Sol, Terra, and Luna11:51:39
Google News — AIOpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it - the-decoder.com11:25:36
Google News — AIOpenAI has released three new models of the GPT-5.6 series, with the flagship Sol at the helm. However, they are currently being tested by clients close to the Trump administration. - dev.ua09:10:00
WIONOpenAI's GPT-5.6 arrives with stronger coding and reasoning. Here's what changed07:30:00

§ how this story moved

07:30:00primary — WION publishes the launch post.
09:10:00Google News — AI picks up coverage.
11:23:42The Decoder picks up coverage.
11:25:36Google News — AI picks up coverage.
11:51:39The Mac Observer picks up coverage.