§ feed · storyline

AI2 releases OLMo - the 4th open-everything LLM

AI2 releases OLMo, a fully open large language model available in 1B and 7B parameter sizes, with a 65B model forthcoming, emphasising open and reproducible research.

Feb 3 · 04:35:10 · primary fetch1 sourceupdated Feb 3 · 04:35:10

AI2 is gaining attention in 2024 with its new OLMo models, including 1B and 7B sizes and a 65B model forthcoming, emphasizing open and reproducible research akin to Pythia. The Miqu-70B model, especially the Mistral Medium variant, is praised for self-correction and speed optimizations. Discussions in TheBloke Discord covered programming language preferences, VRAM constraints for large models, and fine-tuning experiments with Distilbert-base-uncased.

The Mistral Discord highlighted challenges in the GPU shortage affecting semiconductor production involving TSMC, ASML, and Zeiss, debates on open-source versus proprietary models, and fine-tuning techniques including LoRA for low-resource languages. Community insights also touched on embedding chunking strategies and JSON output improvements.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.aiAI2 releases OLMo - the 4th open-everything LLMprimary04:35:10