§ feed · storyline

Fixing Gemma

Unsloth AI's Daniel Han fixes 8 bugs in Google's Gemma model that made it unstable for finetuning, improving its implementation for the community.

Mar 12 · 01:03:26 · primary fetch1 sourceupdated Mar 12 · 01:03:26

Google's Gemma model was found unstable for finetuning until Daniel Han from Unsloth AI fixed 8 bugs, improving its implementation. Yann LeCun explained technical details of a pseudo-random bit sequence for adaptive equalizers, while François Chollet discussed the low information bandwidth of the human visual system. Arav Srinivas reported that Claude 3 Opus showed no hallucinations in extensive testing, outperforming GPT-4 and Mistral-Large in benchmarks.

Reflections from Yann LeCun highlight ongoing AI progress toward human-level intelligence. The community is shifting pipelines to work better with Claude models, and emotional experiences in ML development were shared by Aidan Clark.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.aiFixing Gemmaprimary01:03:26