shipfeedAI news, curated daily

23:56:50 CET
20 MAY23:56:50shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Gemini 3.1 Pro: 2x 3.0 on ARC-AGI 2

Google releases Gemini 3.1 Pro as a developer preview across Gemini, NotebookLM, and Vertex AI, scoring 77.1% on ARC-AGI-2 and 80.6% on SWE-Bench Verified.

Feb 19 · · primary fetch1 sourceupdated Feb 19 ·

Google released Gemini 3.1 Pro, a developer preview integrated across the Gemini app, NotebookLM, Gemini API / AI Studio, and Vertex AI, highlighting a significant reasoning improvement with ARC-AGI-2 = 77.1% and strong coding and agentic-tool benchmarks like SWE-Bench Verified = 80.6%. Independent evaluators such as Artificial Analysis and Arena confirmed top-tier performance and cost efficiency, though community reactions included excitement about practical gains, skepticism about benchmark targeting, and concerns over rollout inconsistencies.

The release emphasizes the same core intelligence powering Gemini 3 Deep Think scaled for practical use, with notable mentions from leaders like @sundarpichai, @demishassabis, and @JeffDean.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiGemini 3.1 Pro: 2x 3.0 on ARC-AGI 2primary