Gemini 3.1 Pro: 2x 3.0 on ARC-AGI 2
Google releases Gemini 3.1 Pro as a developer preview across Gemini, NotebookLM, and Vertex AI, scoring 77.1% on ARC-AGI-2 and 80.6% on SWE-Bench Verified.
Google released Gemini 3.1 Pro, a developer preview integrated across the Gemini app, NotebookLM, Gemini API / AI Studio, and Vertex AI, highlighting a significant reasoning improvement with ARC-AGI-2 = 77.1% and strong coding and agentic-tool benchmarks like SWE-Bench Verified = 80.6%. Independent evaluators such as Artificial Analysis and Arena confirmed top-tier performance and cost efficiency, though community reactions included excitement about practical gains, skepticism about benchmark targeting, and concerns over rollout inconsistencies.
The release emphasizes the same core intelligence powering Gemini 3 Deep Think scaled for practical use, with notable mentions from leaders like @sundarpichai, @demishassabis, and @JeffDean.
- news.smol.aiGemini 3.1 Pro: 2x 3.0 on ARC-AGI 2primary