shipfeedAI news, curated daily

00:32:43 CET
21 MAY00:32:43shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

AI Engineer Code Summit

Google DeepMind's Gemini 3 Pro image model debuts with 4K visuals and fine-grained editing, outperforming GPT-5 on the CritPt physics benchmark while showing regressions in transcription and writing tasks.

Nov 21 · · primary fetch1 sourceupdated Nov 21 ·

The recent AIE Code Summit showcased key developments including Google DeepMind's Gemini 3 Pro Image model, Nano Banana Pro, which features enhanced text rendering, 4K visuals, and fine-grained editing capabilities. Community feedback highlights its strong performance in design and visualization tasks, with high user preference scores. Benchmarking updates reveal the new CritPt physics frontier benchmark where Gemini 3 Pro outperforms GPT-5, though AI still lags on complex unseen research problems.

Agentic task evaluations show varied time horizons and performance gaps between open-weight and closed frontier models, emphasizing ongoing challenges in AI research and deployment. "Instruction following remains jagged for some users," and model fit varies by use case, with Gemini 3 excelling in UI and code tasks but showing regressions in transcription and writing fidelity.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiAI Engineer Code Summitprimary