shipfeedAI news, curated daily

01:16:52 CET
21 MAY01:16:52shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

12/8/2023

Three new AI models launch in the same week: Mistral's Mixtral 8x7B MoE, Together's Mamba models up to 3B, and Stanford Hazy Research's StripedHyena 7B subquadratic attention model.

Dec 8 · · primary fetch1 sourceupdated Dec 8 ·

Three new AI models are highlighted: Mistral's 8x7B MoE model (Mixtral), Mamba models up to 3B by Together, and StripedHyena 7B, a competitive subquadratic attention model from Stanford's Hazy Research. Discussions on Anthropic's Claude 2.1 focus on its prompting technique and alignment challenges. The Gemini AI from Google is noted as potentially superior to GPT-4.

The community also explores Dreambooth for image training and shares resources like the DialogRPT-human-vs-machine model on Hugging Face. Deployment challenges for large language models, including CPU performance and GPU requirements, are discussed with references to Falcon 180B and transformer batching techniques. User engagement includes meme sharing and humor.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.ai12/8/2023primary