§ feed · storyline

Llama-3-70b is GPT-4-level Open Model

Meta releases Llama 3 in 8B and 70B parameter versions with 8K context support, outperforming Llama 2 and Mistral 7B, with Groq serving the 70B model at up to 800 tokens per second.

Apr 20 · 04:21:27 · primary fetch1 sourceupdated Apr 20 · 04:21:27

Meta has released Llama 3, their most capable open large language model with 8B and 70B parameter versions supporting 8K context length and outperforming previous models including Llama 2 and Mistral 7B. Groq serves the Llama 3 70B model at 500-800 tokens/second, making it the fastest GPT-4-level token source. Discussions highlight AI scaling challenges with Elon Musk stating that training Grok 3 will require 100,000 Nvidia H100 GPUs, and AWS planning to acquire 20,000 B200 GPUs for a 27 trillion parameter model.

Microsoft unveiled VASA-1 for lifelike talking face generation, while Stable Diffusion 3 and its extensions received mixed impressions. Concerns about AI energy usage and political bias in AI were also discussed.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.aiLlama-3-70b is GPT-4-level Open Modelprimary04:21:27