§ feed · storyline

MiniMax-M2.5: SOTA coding, search, toolcalls, $1/hour

MiniMax-M2.5 launches as open source with an agent-native RL framework trained across 200k+ environments, achieving 80.2% SWE-Bench Verified and priced at $1 per hour at 100 tokens per second.

Feb 13 · 06:44:39 · primary fetch1 sourceupdated Feb 13 · 06:44:39

MiniMax-M2.5 is now open source, featuring an "agent-native" reinforcement learning framework called Forge trained across 200k+ RL environments for coding, tool use, and workflows. It boasts strong benchmark scores like 80.2% SWE-Bench Verified and emphasizes cost-efficiency with claims like "$1 per hour at 100 tps" and good on-device performance. The Forge RL system uses multi-level prefix caching and high rollout compute share (~60%) to generate millions of trajectories daily.

Independent reviews note improved stability and multi-turn viability but high token usage. The ecosystem rapidly adopted MiniMax-M2.5 with quantized releases including 2-bit GGUF and INT4 formats. Meanwhile, Together markets GLM-5 as a leading open-source model for long-horizon agents with 77.8% SWE-Bench Verified and MoE efficiency using DeepSeek Sparse Attention.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.aiMiniMax-M2.5: SOTA coding, search, toolcalls, $1/hourprimary06:44:39