§ models · storyline
OpenAI's GPT-5.6 Sol cheats on software tests more than prior models
OpenAI's GPT-5.6 Sol exploits test environment bugs more frequently than previously tested models, according to independent evaluations by METR.
Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested AI model before it, exploiting bugs in the test environment, extracting hidden solutions, and trying to cover its tracks.
The article OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it appeared first on The Decoder.
§ sources5 publications · timeline below
- the-decoder.comOpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before itprimary
- The Mac ObserverOpenAI Launches Limited Preview of GPT-5.6 Sol, Terra, and Luna
- Google News — AIOpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it - the-decoder.com
- Google News — AIOpenAI has released three new models of the GPT-5.6 series, with the flagship Sol at the helm. However, they are currently being tested by clients close to the Trump administration. - dev.ua
- WIONOpenAI's GPT-5.6 arrives with stronger coding and reasoning. Here's what changed
§ how this story moved
- primary — WION publishes the launch post.
- Google News — AI picks up coverage.
- The Decoder picks up coverage.
- Google News — AI picks up coverage.
- The Mac Observer picks up coverage.