What's the latest in research?

The most recent research storyline on shipfeed is "GPT-5.6 system card shows Sol below threat level for Mythos use cases". shipfeed grouped 117 new research storylines from across the AI press in the past 7 days.

Which sources cover research?

The sources most active in shipfeed's research feed are arXiv — cs.AI, arXiv — cs.CL, The Decoder, HN Algolia — Research / arXiv, and Hugging Face — Blog.

How many research stories does shipfeed track?

shipfeed is tracking 1,496 research storylines in total — 117 updated in the past 7 days and 767 in the past 30 — each a deduplicated group of articles from its original sources.

How often is this page updated?

Continuously. shipfeed re-checks its research sources around the clock and regroups new coverage into deduplicated storylines; the last-updated time is shown at the top of this page.

Shipfeed. AI News Channel

storylines this week30 active

11:36:52The Decoder

CURSOR · 1 source

Deepmind reinvents mouse cursor for AI era with pointer engineering

Pointer Engineering: Deepmind wants to turn the mouse cursor into the key variable in context engineering. The article From Prompt to Pointer Engineering: Deepmind tries to reinvent the mouse cursor for the AI era…

via the-decoder.com

Saturday, June 27, 2026’s editionSaturday, June 27, 2026

11:23:42The Decoder

GPT · 1 source

OpenAI's GPT-5.6 Sol cheats on software tests more than prior models

Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested AI model before it, exploiting bugs in the test environment, extracting hidden solutions, and trying to cover…

via the-decoder.com

Saturday, June 6, 2026’s editionSaturday, June 6, 2026

15:57:52The Decoder

RESEARCH · 1 source

Sakana AI bets AI that improves itself can break the compute arms race of frontier labs

Sakana AI has launched a dedicated research lab for recursive self-improvement: AI that iteratively improves itself. The Japanese startup, co-founded by Transformer co-author Llion Jones, sees RSI as an alternative to…

via the-decoder.com

Thursday, June 4, 2026’s editionThursday, June 4, 2026

18:30:02Anthropic

CLAUDE · 1 source

Anthropic says Claude authored 80% of code merged into its codebase

Anthropic: Anthropic details its progress toward recursive self-improvement, and its implications, and says 80%+ of the code merged into its codebase is authored by Claude — Our progress toward recursive…

via anthropic.com

Tuesday, June 2, 2026’s editionTuesday, June 2, 2026

07:44:39Smol AI — Daily

RESEARCH · 1 source

Microsoft unveils MAI-Thinking-1 model and Surface RTX dev box

Microsoft introduced MAI-Thinking-1, a 35B parameter MoE model with 256K context, achieving 97% on AIME 2025 and outperforming Sonnet 4.6 in human preference tests. The broader 7-model MAI family spans reasoning, code…

via news.smol.ai

Saturday, May 30, 2026’s editionSaturday, May 30, 2026

14:04:25The Decoder

RESEARCH · 1 source

Terence Tao argues AI could bring division of labor to math for the first time in history

Mathematician Terence Tao describes how AI could reshape math research by enabling division of labor for the first time. Until now, researchers had to master every step themselves, from framing problems to verifying…

via the-decoder.com

Wednesday, May 20, 2026’s editionWednesday, May 20, 2026

02:00:00OpenAI — Blog

RESEARCH · 1 source

An OpenAI model has disproved a central conjecture in discrete geometry

An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in AI-driven mathematics.

via openai.com

Saturday, May 9, 2026’s editionSaturday, May 9, 2026

16:32:14The Decoder

GPT · 1 source

Fields Medalist: ChatGPT 5.5 Pro solved PhD-level math in hours

Fields Medalist Timothy Gowers had ChatGPT 5.5 Pro tackle open problems in number theory. The model improved an exponential bound to a polynomial one in under an hour. An MIT researcher involved calls the key idea…

via the-decoder.com

* sponsored·▶ nimbus

Need an agent shipped this quarter?

Nimbus builds production AI systems — internal tools, customer agents, retrieval pipelines — combining humans and AI end-to-end. From scoped pilot to production in 4–8 weeks.

Nimbus — talk to Nimbus →

Yesterday’s editionSunday, June 28, 2026

21:55:04Thezvi

GPT · 1 source

GPT-5.6 system card shows Sol below threat level for Mythos use cases

Zvi Mowshowitz / Don't Worry About the Vase: GPT-5.6 system card indicates Sol is well below the level of most worrisome Mythos use cases, suggesting all GPT-5.6 versions could be released without delay — While…

via thezvi.substack.com

Tuesday, June 23, 2026’s editionTuesday, June 23, 2026

10:18:48Google News — AI Products & Releases

SAFETY · 1 source

OpenAI's Cybersecurity AI Surpasses Anthropic's Mythos 5

via Google News — AI Products & Releases

Sunday, June 21, 2026’s editionSunday, June 21, 2026

16:05:39Cybernews

RESEARCH · 1 source

Anthropic poaches Nobel Prize-winning scientist from Google DeepMind

Anthropic poaches Nobel Prize-winning scientist from Google DeepMind Cybernews

via Cybernews

07:13:38Indiatimes

RESEARCH · 1 source

Nobel Laureate John Jumper jumps to Anthropic from Google’s DeepMind

Nobel Laureate John Jumper jumps to Anthropic from Google’s DeepMind Indiatimes

via Indiatimes

Thursday, June 18, 2026’s editionThursday, June 18, 2026

15:15:57Tech Times

RESEARCH · 1 source

AI model boosts Chan-Lam yields across 10,080 reactions

AI Drug Discovery Chemistry Hits Wet Lab: GPT-5.4 Boosts Chan-Lam Yields in 10,080 Reactions Tech Times

via Tech Times

10:00:00OpenAI — Blog

RESEARCH · 1 source

Using AI to help physicians diagnose rare genetic diseases affecting children

Researchers used an OpenAI reasoning model to help diagnose rare diseases, identifying 18 new diagnoses in previously unsolved cases.

via openai.com

Wednesday, June 10, 2026’s editionWednesday, June 10, 2026

19:38:24The Decoder

SAFETY · 1 source

Anthropic study shows AI needs hours, not weeks, to build exploits from security patches

Anthropic's security team found that its Mythos Preview AI model can turn security patches for Firefox and the Windows kernel into working exploits within hours, for a few thousand dollars and no specialized knowledge…

via the-decoder.com

Monday, June 1, 2026’s editionMonday, June 1, 2026

13:00:00Ars Technica — AI

RESEARCH · 1 source

An OpenAI model solved a famous math problem that stumped humans for 80 years

I tried to explain OpenAI’s solution more clearly than OpenAI did.

via arstechnica.com

06:45:07NVIDIA — AI Blog

RESEARCH · 1 source

How Cosmos 3 Helps Physical AI Think Before It Acts

The new, open NVIDIA world foundation model brings vision reasoning, multimodal generation and action prediction together to help robots, autonomous vehicles and vision AI agents think before acting in the real world.

via blogs.nvidia.com

Wednesday, May 27, 2026’s editionWednesday, May 27, 2026

14:26:32Axios

RESEARCH · 1 source

Biohub releases world model of protein biology to researchers

Ina Fried / Axios: Biohub, a Mark Zuckerberg- and Priscilla Chan-funded institute, releases “a world model of protein biology” to researchers for prediction, design, and discovery — Biohub, the Mark…

via axios.com

Thursday, May 21, 2026’s editionThursday, May 21, 2026

18:11:50The Decoder

RESEARCH · 1 source

OpenAI's reasoning model disproves decades-old Erdős conjecture

A reasoning model from OpenAI has disproved a conjecture by mathematician Paul Erdős on unit-distance geometry that stood open since 1946 - using tools from algebraic number theory that experts never expected in this…

via the-decoder.com

Wednesday, May 20, 2026’s editionWednesday, May 20, 2026

17:10:05HN Algolia — Research / arXiv

RESEARCH · 1 source

Stable Audio 3

via arxiv.org

Wednesday, May 13, 2026’s editionWednesday, May 13, 2026

23:00:56AI Security Institute

SAFETY · 1 source

Mythos Preview first to complete both AISI cyber ranges

AI Security Institute: Mythos Preview is the first AI model to complete both of AISI's cyber ranges, which measure models' cyberattack capabilities; GPT-5.5 solved only one of them — In February 2026, we internally…

via techmeme.com

Tuesday, May 12, 2026’s editionTuesday, May 12, 2026

16:40:07Google DeepMind — Blog

AGENTS · 1 source

Co-Scientist: A multi-agent AI partner to accelerate research

Introducing Co-Scientist, a collaborative AI partner built with Gemini to help researchers accelerate scientific breakthroughs.

via deepmind.google

Thursday, May 7, 2026’s editionThursday, May 7, 2026

19:54:02HN Algolia — Claude / Anthropic

CLAUDE · 1 source

Natural Language Autoencoders: Turning Claude's Thoughts into Text

via anthropic.com

Friday, June 26, 2026’s editionFriday, June 26, 2026

03:12:30Latent Space

RESEARCH · 1 source

OpenAI Codex output tokens surge across divisions since November

It's happening.

via latent.space

* sponsored·▶ nimbus

Need an agent shipped this quarter?

Nimbus builds production AI systems — internal tools, customer agents, retrieval pipelines — combining humans and AI end-to-end. From scoped pilot to production in 4–8 weeks.

Nimbus — talk to Nimbus →

Wednesday, June 24, 2026’s editionWednesday, June 24, 2026

18:51:52Google Research — Blog

RESEARCH · 1 source

Thinking to recall: How reasoning unlocks parametric knowledge in LLMs

Generative AI

via research.google

17:55:01Theregister

SAFETY · 1 source

Nature paper challenges Microsoft quantum claims over coding errors

Thomas Claburn / The Register: Nature publishes a peer-reviewed paper alleging that Microsoft's 2025 quantum breakthrough claims were based on “basic Python errors” and data cherry-picking — Nature…

via theregister.com

04:15:01Nytimes

SAFETY · 1 source

NSA's red-teaming tests found Mythos 5 identified classified system

New York Times: The NSA was red-teaming Mythos 5 before losing access amid the Anthropic dispute; the tests showed Mythos identified cybersecurity flaws in classified systems — A recent episode underscored the…

via nytimes.com

Tuesday, June 23, 2026’s editionTuesday, June 23, 2026

00:07:03R&D World

RESEARCH · 1 source

AI chemist improves stubborn coupling reaction

OpenAI and Molecule.one report a near-autonomous AI chemist that improved a stubborn coupling reaction R&D World

via R&D World

Sunday, June 21, 2026’s editionSunday, June 21, 2026

19:53:58WinBuzzer

AGENTS · 1 source

Google DeepMind Tests AI Controls on One Million Agent Tasks

Google DeepMind Tests AI Controls on One Million Agent Tasks WinBuzzer

via WinBuzzer

Saturday, June 20, 2026’s editionSaturday, June 20, 2026

18:39:57TechCrunch

SAFETY · 1 source

Nobel laureate John Jumper is leaving DeepMind for rival Anthropic

Nobel laureate John Jumper is leaving DeepMind for rival Anthropic TechCrunch

via TechCrunch