shipfeedAI news, curated daily

17:03:22 CET
29 JUN17:03:22shipfeed
pull to refreshlast sync

Research — shipfeed

About · Research

Papers, evals, SOTA claims, alignment.

Research50 storylines

SponsoredNimbuspaid placement
Featured partner · Agents

Need an agent shipped this quarter?

Nimbus builds production AI systems combining humans and AI end-to-end. From scoped pilot to production in 4 to 8 weeks.

Talk to Nimbus →
Saturday, June 27, 2026’s edition
The Decoder+3 sources
GPT · 4 sources

OpenAI's GPT-5.6 Sol cheats on software tests more than prior models

Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested AI model before it, exploiting bugs in the test environment, extracting hidden solutions, and trying to cover…

via the-decoder.com·+4 sources+4 sourcesthe-decoder.comprimaryThe Mac ObserverGoogle News — AIWION·Click to report a broken or paywalled link. Two distinct reports hide the row.
Friday, June 26, 2026’s edition
The Decoder
EVALS · 1 source

AI model runs nonstop 19 days on $2,600 coding task

Epoch AI's new MirrorCode benchmark tests whether AI models can recreate complete programs without access to the original code. Claude Opus 4.7 leads with a 56 percent solve rate, rebuilding a 16,000-line toolkit in…

via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Thursday, June 25, 2026’s edition
Wednesday, June 24, 2026’s edition
Theregister
SAFETY · 1 source

Nature paper challenges Microsoft quantum claims over coding errors

Thomas Claburn / The Register: Nature publishes a peer-reviewed paper alleging that Microsoft's 2025 quantum breakthrough claims were based on “basic Python errors” and data cherry-picking — Nature…

via theregister.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Google News — AI Products & Releases+8 sources
AGENTS · 9 sources

Anthropic model finds vulnerabilities in classified US systems

Anthropic AI Model Identifies Vulnerabilities in Classified U.S. Government Systems During Testing citybiz

via Google News — AI Products & Releases·+9 sources+9 sourcesGoogle News — AI Products & ReleasesprimaryYellow.comYahooBeInCryptoIndexBoxEuronews.comLet's Data ScienceAction News JaxWSOC TV·Click to report a broken or paywalled link. Two distinct reports hide the row.
Tuesday, June 23, 2026’s edition
R&D World
RESEARCH · 1 source

AI chemist improves stubborn coupling reaction

OpenAI and Molecule.one report a near-autonomous AI chemist that improved a stubborn coupling reaction R&D World

via R&D World·Click to report a broken or paywalled link. Two distinct reports hide the row.
Sunday, June 21, 2026’s edition
Friday, June 19, 2026’s edition
Thursday, June 18, 2026’s edition
Tuesday, June 16, 2026’s edition
Google DeepMind — Blog
AGENTS · 1 source

Securing the future of AI agents

Securing internal systems with an AI Control Roadmap, combining traditional safeguards and real-time monitoring.

via deepmind.google·Click to report a broken or paywalled link. Two distinct reports hide the row.
Thursday, June 11, 2026’s edition
Wednesday, June 10, 2026’s edition
Reddit — AI Communities
AI · 1 source

JPMorgan, OQC, and AMD Quantum AI Collaboration

JPMorgan, OQC, and AMD have launched a research collaboration focused on a new quantum AI computing platform for financial applications.

via reddit.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Tuesday, June 9, 2026’s edition
Monday, June 8, 2026’s edition
The Decoder
RESEARCH · 1 source

Microsoft's Lens model shows detailed captions beat scale in image

Microsoft Research presents Lens, a text-to-image model with just 3.8 billion parameters that matches much larger rivals on benchmarks, at a fraction of the training cost. The secret sauce: 800 million detailed image…

via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Sunday, June 7, 2026’s edition
Saturday, June 6, 2026’s edition
The Decoder+1 source
RESEARCH · 2 sources

Sakana AI bets AI that improves itself can break the compute arms race of frontier labs

Sakana AI has launched a dedicated research lab for recursive self-improvement: AI that iteratively improves itself. The Japanese startup, co-founded by Transformer co-author Llion Jones, sees RSI as an alternative to…

via the-decoder.com·+2 sources+2 sourcesthe-decoder.comprimaryreddit.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Thursday, June 4, 2026’s edition
Anthropic
CLAUDE · 1 source

Anthropic says Claude authored 80% of code merged into its codebase

Anthropic: Anthropic details its progress toward recursive self-improvement, and its implications, and says 80%+ of the code merged into its codebase is authored by Claude — Our progress toward recursive…

via anthropic.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Wednesday, June 3, 2026’s edition
Reddit — AI Communities
AI · 1 source

AI model scaling and cognitive benchmarking research

New research studies demonstrate that model capabilities interact and cooperate significantly above a 3.5B parameter threshold, while also identifying fundamental cognitive blind spots in state-of-the-art models like…

via reddit.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
NVIDIA — AI Blog
AGENTS · 1 source

Nvidia research advances grasping, autonomous driving and agent

New NVIDIA Research breakthroughs show how training at scale — across gripper types, driving scenarios and virtual worlds — creates AI that generalizes to diverse applications.

via blogs.nvidia.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Nytimes
SAFETY · 1 source

University of Toronto researchers claim to have developed a "worm" powered by open source AI that exploits known flaws and tailors attacks for each computer (Cade Metz/New York Times)

Cade Metz / New York Times: University of Toronto researchers claim to have developed a “worm” powered by open source AI that exploits known flaws and tailors attacks for each computer — Researchers…

via nytimes.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Tuesday, June 2, 2026’s edition