§ feed · storyline
Rethinking how we measure AI intelligence
Game Arena launches as an open-source platform for head-to-head evaluation of AI models in competitive environments with defined winning conditions.
Game Arena is a new, open-source platform for rigorous evaluation of AI models. It allows for head-to-head comparison of frontier systems in environments with clear winning conditions.
§ sources1 publication · timeline below
- deepmind.googleRethinking how we measure AI intelligenceprimary