shipfeedAI news, curated daily

23:05:42 CET
20 MAY23:05:42shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

OLMo 2 - new SOTA Fully Open LLM

AI2 releases OLMo-2, a fully open LLM trained on 5T tokens that matches Llama 3.1 8B performance, using learning rate annealing and the Dolmino dataset.

Nov 27 · · primary fetch1 sourceupdated Nov 27 ·

AI2 has updated OLMo-2 to roughly Llama 3.1 8B equivalent, training with 5T tokens and using learning rate annealing and new high-quality data (Dolmino). They credit Tülu 3 and its "Reinforcement Learning with Verifiable Rewards" approach. On Reddit, Qwen2.5-72B instruct model shows near lossless performance with AutoRound 4-bit quantization, available on HuggingFace in 4-bit and 2-bit versions, with discussions on MMLU benchmark and quantization-aware training.

HuggingFace released SmolVLM, a 2B parameter vision-language model running efficiently on consumer GPUs, supporting fine-tuning on Google Colab and demonstrating strong OCR capabilities with adjustable resolution and quantization options.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiOLMo 2 - new SOTA Fully Open LLMprimary