§ feed · storyline

AlphaProof + AlphaGeometry2 reach 1 point short of IMO Gold

Google DeepMind's AlphaProof and AlphaGeometry 2 solved four of six IMO 2024 problems, scoring one point below the gold medal threshold but taking far longer than human competitors.

Jul 26 · 03:15:56 · primary fetch1 sourceupdated Jul 26 · 03:15:56

Search+Verifier highlights advances in neurosymbolic AI during the 2024 Math Olympics. Google DeepMind's combination of AlphaProof and AlphaGeometry 2 solved four out of six IMO problems, with AlphaProof being a finetuned Gemini model using an AlphaZero approach, and AlphaGeometry 2 trained on significantly more synthetic data with a novel knowledge-sharing mechanism. Despite impressive results, human judges noted the AI required much longer time than human competitors.

Meanwhile, Meta AI released Llama 3.1 with a 405B parameter model and smaller variants, and Mistral AI launched Mistral Large 2 with 123B parameters and 128k context windows, outperforming Llama 3.1 on coding tasks and multilingual benchmarks. This marks significant progress in AI mathematical reasoning, model scaling, and multilingual capabilities.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.aiAlphaProof + AlphaGeometry2 reach 1 point short of IMO Goldprimary03:15:56