§ feed · storyline

GPT 5.4: SOTA Knowledge Work -and- Coding -and- CUA Model, OpenAI is so very back

OpenAI launches GPT-5.4 and GPT-5.4 Pro with native computer use, up to 1M token context, a Codex fast mode, and integrations with Cursor, Perplexity, and Arena.

Mar 5 · 06:44:39 · primary fetch1 sourceupdated Mar 5 · 06:44:39

OpenAI launched GPT-5.4 and GPT-5.4 Pro with unified mainline and Codex models, featuring native computer use, up to ~1M token context, and efficiency improvements including a new Codex `/fast` mode. Benchmarks showed strong results like OSWorld-Verified 75.0% surpassing human baseline and GDPval 83% against industry pros. User feedback highlighted coding utility but raised concerns about pricing and overthinking.

Integration with devtools like Cursor, Perplexity, and Arena was announced. In systems research, FlashAttention-4 (FA4) was introduced with near-matmul speed attention on Blackwell GPUs, featuring innovations like polynomial exp emulation and online softmax. "Steering mid-response" and "fewer tokens, faster speed" were emphasized as UX and efficiency improvements.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.aiGPT 5.4: SOTA Knowledge Work -and- Coding -and- CUA Model, OpenAI is so very backprimary06:44:39