§ feed · storyline

12/25/2023: Nous Hermes 2 Yi 34B for Christmas

Nous Research releases Nous Hermes 2 on the Yi 34B base model, claiming top performance among open models against Mixtral, DeepSeek, and Qwen.

Dec 26 · 08:45:27 · primary fetch1 sourceupdated Dec 26 · 08:45:27

Teknium released Nous Hermes 2 on Yi 34B, positioning it as a top open model compared to Mixtral, DeepSeek, and Qwen. Apple introduced Ferret, a new open-source multimodal LLM. Discussions in the Nous Research AI Discord focused on AI model optimization and quantization techniques like AWQ, GPTQ, and AutoAWQ, with insights on proprietary optimization and throughput metrics.

Additional highlights include the addition of NucleusX Model to transformers, a 30B model with 80 MMLU, and the YAYI 2 language model by Wenge Technology trained on 2.65 trillion tokens. "AutoAWQ outperforms vLLM up to batch size 8" was noted, and proprietary parallel decoding and tensor parallelization across GPUs were discussed for speed improvements.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.ai12/25/2023: Nous Hermes 2 Yi 34B for Christmasprimary08:45:27