Zvi Mowshowitz / Don't Worry About the Vase: GPT-5.6 system card indicates Sol is well below the level of most worrisome Mythos use cases, suggesting all GPT-5.6 versions could be released without delay — While…
360 founder Zhou Hongyi presents two AI security tools designed to compete with Anthropic's Mythos. One has already flagged 3,432 vulnerabilities. Zhou admits Chinese models trail Western ones by 20 to 30 percent, but…
via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested AI model before it, exploiting bugs in the test environment, extracting hidden solutions, and trying to cover…
Epoch AI's new MirrorCode benchmark tests whether AI models can recreate complete programs without access to the original code. Claude Opus 4.7 leads with a 56 percent solve rate, rebuilding a 16,000-line toolkit in…
via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Thursday, June 25, 2026’s editionThursday, June 25, 2026
Researchers introduced DFlash, a speculative decoding model that drafts entire token blocks in parallel, achieving up to 15x throughput on NVIDIA Blackwell GPUs.
via marktechpost.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Zhipu AI's GLM-5.2 nearly matches Claude Opus 4.7 in a Snowflake benchmark with 103 coding tasks at one-fifth the cost per output token. But the Chinese model burns through nearly twice as many tokens per task. Still…
via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Thomas Claburn / The Register: Nature publishes a peer-reviewed paper alleging that Microsoft's 2025 quantum breakthrough claims were based on “basic Python errors” and data cherry-picking — Nature…
via theregister.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
13:41:02Google News — AI Products & Releases+8 sources
OpenAI researchers show that reinforcement learning on desired behavioral traits like truthfulness and corrigibility works across domains. Training on health data also improved deception detection, and the model scored…
via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Thursday, June 18, 2026’s editionThursday, June 18, 2026
Anthropic's security team found that its Mythos Preview AI model can turn security patches for Firefox and the Windows kernel into working exploits within hours, for a few thousand dollars and no specialized knowledge…
via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Microsoft Research presents Lens, a text-to-image model with just 3.8 billion parameters that matches much larger rivals on benchmarks, at a fraction of the training cost. The secret sauce: 800 million detailed image…
via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Sakana AI has launched a dedicated research lab for recursive self-improvement: AI that iteratively improves itself. The Japanese startup, co-founded by Transformer co-author Llion Jones, sees RSI as an alternative to…
Anthropic: Anthropic details its progress toward recursive self-improvement, and its implications, and says 80%+ of the code merged into its codebase is authored by Claude — Our progress toward recursive…
via anthropic.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
New research studies demonstrate that model capabilities interact and cooperate significantly above a 3.5B parameter threshold, while also identifying fundamental cognitive blind spots in state-of-the-art models like…
via reddit.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
New NVIDIA Research breakthroughs show how training at scale — across gripper types, driving scenarios and virtual worlds — creates AI that generalizes to diverse applications.
via blogs.nvidia.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Cade Metz / New York Times: University of Toronto researchers claim to have developed a “worm” powered by open source AI that exploits known flaws and tailors attacks for each computer — Researchers…
via nytimes.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Tuesday, June 2, 2026’s editionTuesday, June 2, 2026