§ feed · storyline

OLMo 2 - new SOTA Fully Open LLM

AI2 releases OLMo-2, a fully open LLM trained on 5T tokens that matches Llama 3.1 8B performance, using learning rate annealing and the Dolmino dataset.

Nov 27 · 06:17:18 · primary fetch1 sourceupdated Nov 27 · 06:17:18

AI2 has updated OLMo-2 to roughly Llama 3.1 8B equivalent, training with 5T tokens and using learning rate annealing and new high-quality data (Dolmino). They credit Tülu 3 and its "Reinforcement Learning with Verifiable Rewards" approach. On Reddit, Qwen2.5-72B instruct model shows near lossless performance with AutoRound 4-bit quantization, available on HuggingFace in 4-bit and 2-bit versions, with discussions on MMLU benchmark and quantization-aware training.

HuggingFace released SmolVLM, a 2B parameter vision-language model running efficiently on consumer GPUs, supporting fine-tuning on Google Colab and demonstrating strong OCR capabilities with adjustable resolution and quantization options.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.aiOLMo 2 - new SOTA Fully Open LLMprimary06:17:18