§ feed · storyline

Cohere's Command A claims #3 open model spot (after DeepSeek and Gemma)

Cohere's Command A, a 111B open-weight model with a 256K context window, ranks third on the LMArena leaderboard behind DeepSeek and Gemma.

Mar 18 · 01:28:53 · primary fetch1 sourceupdated Mar 18 · 01:28:53

Cohere's Command A model has solidified its position on the LMArena leaderboard, featuring an open-weight 111B parameter model with an unusually long 256K context window and competitive pricing. Mistral AI released the lightweight, multilingual, and multimodal Mistral AI Small 3.1 model, optimized for single RTX 4090 or Mac 32GB RAM setups, with strong performance on instruct and multimodal benchmarks.

The new OCR model SmolDocling offers fast document reading with low VRAM usage, outperforming larger models like Qwen2.5VL. Discussions highlight the importance of system-level improvements over raw LLM advancements, and MCBench is recommended as a superior AI benchmark for evaluating model capabilities across code, aesthetics, and awareness.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.aiCohere's Command A claims #3 open model spot (after DeepSeek and Gemma)primary01:28:53