§ feed · storyline

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

DeepSeek updates its V3 open-weight model to V3.2 with changes to architecture, sparse attention mechanisms, and reinforcement learning training.

Dec 3 · 13:03:33 · primary fetch1 sourceupdated Dec 3 · 13:03:33

Understanding How DeepSeek's Flagship Open-Weight Models Evolved

§ sources1 publication · timeline below