shipfeedAI news, curated daily

01:28:05 CET
21 MAY01:28:05shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Claude 3 is officially America's Next Top Model

Claude 3 Opus tops blind Elo rankings above GPT-4 Turbo and Mistral Large, while Claude 3 Haiku sets a new cost-performance benchmark.

Mar 27 · · primary fetch1 sourceupdated Mar 27 ·

Claude 3 Opus outperforms GPT4T and Mistral Large in blind Elo rankings, with Claude 3 Haiku marking a new cost-performance frontier. Fine-tuning techniques like QLoRA on Mistral 7B and evolutionary model merging on HuggingFace models are highlighted. Public opinion shows strong opposition to ASI development. Research supervision opportunities in AI alignment are announced.

The Stable Diffusion 3 (SD3) release raises workflow concerns for tools like ComfyUI and automatic1111. Opus shows a 5% performance dip on OpenRouter compared to the Anthropic API. A new benchmark stresses LLM recall at long contexts, with Mistral 7B struggling and Qwen 72b performing well.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiClaude 3 is officially America's Next Top Modelprimary