shipfeedAI news, curated daily

02:04:22 CET
21 MAY02:04:22shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Apple's OpenELM beats OLMo with 50% of its dataset, using DeLighT

Apple releases OpenELM, an open large language model in sizes from 270M to 3B parameters, using a layer-wise scaling architecture that outperforms OLMo on half the training data.

Apr 26 · · primary fetch1 sourceupdated Apr 26 ·

Apple advances its AI presence with the release of OpenELM, its first relatively open large language model available in sizes from 270M to 3B parameters, featuring a novel layer-wise scaling architecture inspired by the DeLight paper. Meanwhile, Meta's LLaMA 3 family pushes context length boundaries with models supporting over 160K tokens and an 8B-Instruct model with 262K context length released on Hugging Face, alongside performance improvements in quantized versions.

A new paper on AI alignment highlights KTO as the best-performing method, with sensitivity to training data volume noted. In AI ethics and regulation, former Google CEO Eric Schmidt warns about the risks of open-source AI empowering bad actors and geopolitical rivals, while a U.S. proposal aims to enforce "Know Your Customer" rules to end anonymous cloud usage.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiApple's OpenELM beats OLMo with 50% of its dataset, using DeLighTprimary