shipfeedAI news, curated daily

23:05:45 CET
20 MAY23:05:45shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Anthropic's Claude Opus 4.7

Anthropic releases Claude Opus 4.7 with a new tokenizer, xhigh reasoning tier, and benchmark results including 87.6% on SWE-bench Verified and 69.4% on TerminalBench.

Apr 16 · · primary fetch1 sourceupdated Apr 16 ·

Anthropic launched Claude Opus 4.7, its most capable Opus model yet, featuring stronger coding and agentic performance, a new tokenizer, and improved long-context handling with a new xhigh reasoning tier. Benchmarks show substantial gains, including SWE-bench Pro 64.3%, SWE-bench Verified 87.6%, and TerminalBench 69.4%, with top rankings on Vals Index and GDPval-AA. Technical changes include a new tokenizer and increased image input resolution to 3.75MP. Some long-context benchmarks showed mixed results, with a shift in focus from MRCR to Graphwalks.

Adoption was rapid across tools like Cursor, VS Code, Replit Agent, and Perplexity. Meanwhile, OpenAI expanded Codex into a broader computer agent with Mac computer use, in-app browser, image generation/editing, 90+ plugins, multi-terminal support, SSH remote devbox access, and richer file previews. A new vertical life-sciences model, GPT-Rosalind, was also introduced.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiAnthropic's Claude Opus 4.7primary