shipfeedAI news, curated daily

01:23:19 CET
21 MAY01:23:19shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Qwen3.5-397B-A17B: the smallest Open-Opus class, very efficient model

Alibaba releases Qwen3.5-397B-A17B, an open-weight multimodal MoE model with 256K token context, 201-language support, and a hybrid linear attention architecture under Apache-2.0.

Feb 16 · · primary fetch1 sourceupdated Feb 16 ·

Alibaba released Qwen3.5-397B-A17B, an open-weight model featuring native multimodality, spatial intelligence, and a hybrid linear attention + sparse MoE architecture supporting 201 languages and long context windows up to 256K tokens. The model shows improvements over previous versions like Qwen3-Max and Qwen3-VL, with a sparsity ratio of about 4.3%. Community discussions highlighted the Gated Delta Networks enabling efficient inference despite large model size (~800GB BF16), with successful local runs on Apple Silicon using quantization techniques.

The hosted API version, Qwen3.5-Plus, supports 1M context and integrates search and code interpreter features. This release follows other Chinese labs like Z.ai, Minimax, and Kimi in refreshing large models. The model is licensed under Apache-2.0 and is expected to be the last major release before DeepSeek v4. The news also notes Pete Steinberger joining OpenAI.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiQwen3.5-397B-A17B: the smallest Open-Opus class, very efficient modelprimary