§ feed · storyline

not much happened today

Anthropic, Alibaba, and others release several model updates including Claude Opus/Sonnet 4.6, Qwen 3.5, and GLM-5, alongside ongoing debate over benchmark reliability.

Feb 18 · 06:44:39 · primary fetch1 sourceupdated Feb 18 · 06:44:39

Anthropic released Claude Opus/Sonnet 4.6, showing a significant intelligence index jump but with increased token usage and cost. Anthropic also shared insights on AI agent autonomy, highlighting human-in-the-loop prevalence and software engineering tool calls. Alibaba launched Qwen 3.5 with discussions on reasoning efficiency and token bloat, plus open-sourced Qwen3.5-397B-A17B FP8 weights.

The GLM-5 technical report introduced asynchronous agent reinforcement learning and compute-efficient techniques. Rumors about Gemini 3.1 Pro suggest longer reasoning capabilities, while MiniMax M2.5 appeared on community leaderboards. The community debates benchmark reliability and model performance nuances.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.ainot much happened todayprimary06:44:39