§ models · storyline

Microsoft unveils MAI-Thinking-1 model and Surface RTX dev box

Microsoft unveils MAI-Thinking-1, a 35B MoE model with 256K context scoring 97% on AIME 2025, alongside a seven-model MAI family and the Surface RTX Spark Dev Box for local inference.

Jun 2 · 07:44:39 · primary fetch2 sourcesupdated Jun 3 · 07:49:02

Microsoft introduced MAI-Thinking-1, a 35B parameter MoE model with 256K context, achieving 97% on AIME 2025 and outperforming Sonnet 4.6 in human preference tests. The broader 7-model MAI family spans reasoning, code, image, speech, and voice, with third-party availability on OpenRouter, fal, and Baseten. The detailed 109-page technical report revealed insights on scaling, MFU, RL/post-training, and data curation, highlighting no third-party distillation and advanced prompt optimization techniques. Microsoft emphasized agent-native devices and local inference with projects like Project Solara / Scout and the Surface RTX Spark Dev Box, alongside software innovations such as the Copilot desktop app and MAI-Code-1-Flash integration.

Meanwhile, local-first computer-use agents like Holo 3.1 (Qwen-based, 0.8B to 35B parameters) support laptops and small workstations with optimized formats and strong benchmark results. Desktop shells for agents, including Hermes Desktop, Devin Desktop, and agent-neutral approaches compatible with Devin, Claude Code, and Codex, are proliferating, with hybrid local/cloud execution becoming the default architecture as seen in Perplexity Computer's hybrid…

read full article on news.smol.ai ↗

§ sources2 publications · timeline below

news.smol.aiMicrosoft Build: MAI-Thinking-1 and MAI Family models, Surface RTX Spark Dev Box, and OpenClaw in Windowsprimary07:44:39
latent.spaceMicrosoft Build: MAI-Thinking-1 and MAI Family models07:49:02

§ how this story moved

07:44:39primary — Smol AI — Daily publishes the launch post.
07:49:02Latent Space picks up coverage.