Google DeepMind: Google DeepMind details a Gemini-powered mouse pointer that understands what it is pointing at, allowing users to perform tasks without using text-heavy prompts…
via techmeme.com · Click to report a broken or paywalled link. Two distinct reports hide the row.
Google DeepMind: Google DeepMind details a Gemini-powered mouse pointer that understands what it is pointing at, allowing users to perform tasks without using text-heavy prompts — We are developing more seamless, intuitive ways to collaborate with AI — The mouse pointer has been a constant companion …
Introducing our new Cline CLI built on our new SDK and comes with a snappy new TUI. Install: ```sh npm install -g cline ``` For nightly builds: ```sh npm install -g cline@nightly ```
via github.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Thinking Machines Lab released TML-Interaction-Small, a 276B parameter multimodal interaction model with 200ms real-time processing for audio, video, and text.
via unite.ai·Click to report a broken or paywalled link. Two distinct reports hide the row.
Reuters: The US DOD says it is deploying Mythos to find and patch software vulnerabilities across the US government, even as it works on a transition away from Anthropic — WASHINGTON, May 12 (Reuters) - The Pentagon is…
via techmeme.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Mira Murati's start-up presents its first AI model and aims to free voice AI from the question-and-answer model. The model processes audio, video and text in 200-millisecond chunks in parallel and aims to beat OpenAI's…
via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Nimbus builds production AI systems — internal tools, customer agents, retrieval pipelines — combining humans and AI end-to-end. From scoped pilot to production in 4–8 weeks.
With Gemini Intelligence, Google is introducing new AI features for Android that automate multi-step tasks, summarize web content, fill out forms, and turn spoken thoughts into polished text messages. The article…
via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Fast mode for Claude Opus 4.7 is now available on in research preview.AI Gateway Fast mode delivers ~2.5x faster output token generation with full Opus 4.7 intelligence. This is an early, experimental feature. To…
via vercel.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
OpenAI releases three new realtime audio models via Realtime API: GPT-Realtime-2 with GPT-5 class reasoning, GPT-Realtime-Translate for live translation, and GPT-Realtime-Whisper for transcription, now out of beta.
via goml.io·Click to report a broken or paywalled link. Two distinct reports hide the row.
OpenAI introduces workspace agents in ChatGPT, an evolution of GPTs for workplace tasks like reports, coding, and messaging, available in research preview for Business, Enterprise, Edu, and Teachers plans.
via openai.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
AWS launches Claude Platform on AWS, providing direct access to Anthropic’s Claude Platform via AWS accounts, with IAM authentication, billing through AWS Marketplace, supporting agents, tools, and various regions.
via aws.amazon.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Patch Changes 725f2ed: feat(mcp): expose server instructions to be accessible through client 7281592: fix(mcp): use negotiated protocol version in transport request headers
via github.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
from shipfeed · weekly digest
The week in AI, in one short email.
Friday digest. The 10 most important shipping events. No promo, no AI-summary slop. Three minutes to read, unsubscribe in one click.
0.101.0 (2026-05-11) Full Changelog: v0.100.0...v0.101.0 Features aws: Add AWS client for Claude Platform on AWS (1e70e3a) Bug Fixes client: add missing f-string prefix in file type error message (06d109a) Chores…
via github.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Tutorial on implementing Memori for agent-native memory in LLM apps, supporting persistent multi-user sessions, streaming, and async calls with isolated user contexts.
via marktechpost.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
NVIDIA AI released cuda-oxide, an experimental Rust-to-CUDA compiler backend that compiles SIMT GPU kernels directly to PTX without DSLs or C/C++ code.
via marktechpost.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Superset on Vercel Software development with AI started as a single engineer chatting with a single agent about a local repo. Today, developers direct fleets of agents in the cloud, but traditional tools were built for…
via vercel.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
MarkTechPost describes GitHub’s open-source Spec-Kit, which provides a structured spec-driven development workflow for AI coding agents (including multiple integrations) via a CLI and a command flow for planning and…
OpenAI launched GPT‑Realtime‑2 plus companion voice models (Translate and Whisper), positioning them for live, tool-using voice interactions. The update highlights built-in production features such as parallel tool…
via opentools.ai·Click to report a broken or paywalled link. Two distinct reports hide the row.
Teradata announced an Autonomous Knowledge Platform for running AI agents continuously across cloud, on-prem, and hybrid environments. It includes AI Studio for building/managing/governing models and agents plus…
via completeaitraining.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Nimbus builds production AI systems — internal tools, customer agents, retrieval pipelines — combining humans and AI end-to-end. From scoped pilot to production in 4–8 weeks.
Google announced Gemini 3.1 Flash-Lite is generally available on the Gemini Enterprise Agent Platform, positioning it for ultra-low latency, high-volume agentic workloads such as tool calling and orchestration.
via cloud.google.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
OpenAI introduces GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper for live voice interactions, translation, and streaming transcription, featuring GPT-5-class reasoning and low-latency performance.
via itbrief.ie·Click to report a broken or paywalled link. Two distinct reports hide the row.
Sam Sabin / Axios: OpenAI is rolling out GPT-5.5-Cyber, a security-focused variant of the model, in a limited preview capacity to vetted cybersecurity teams — The capabilities of the new models have sparked an urgent…
via techmeme.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Russell Brandom / TechCrunch: Mozilla says Anthropic's Mythos Preview and other AI models helped it identify and ship 423 Firefox security bug fixes in April, compared to 31 a year earlier — When Anthropic unveiled its…
Anthropic releases dreaming (research preview), outcomes, multiagent orchestration, and webhooks for Claude Managed Agents to enhance agent improvement, task evaluation, and complex workflow handling.
via itbrief.news·Click to report a broken or paywalled link. Two distinct reports hide the row.
How OpenAI runs Codex securely with sandboxing, approvals, network policies, and agent-native telemetry to support safe and compliant coding agent adoption.
via openai.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
DeepSeek-V4 makes million-token context a serving-systems problem. Together AI explores the inference work behind V4 on NVIDIA HGX B200, including compressed KV layouts, prefix caching, kernel maturity, and endpoint…
via together.ai·Click to report a broken or paywalled link. Two distinct reports hide the row.
from shipfeed · weekly digest
The week in AI, in one short email.
Friday digest. The 10 most important shipping events. No promo, no AI-summary slop. Three minutes to read, unsubscribe in one click.
OpenAI released MRC, a new open networking protocol developed with AMD, Broadcom, Intel, Microsoft, and NVIDIA to handle network congestion and failures in large-scale AI supercomputer training clusters using RDMA over…
via marktechpost.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Zyphra released ZAYA1-8B, a small MoE model with 760M active parameters trained on AMD hardware, outperforming larger models on math and coding benchmarks, available under Apache 2.0.
via marktechpost.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
OpenAI released GPT-Realtime-2, a voice model with GPT-5-class reasoning, tool use, interruption handling, and extended context windows up to 128K tokens, achieving top scores on Big Bench Audio and Conversational…
via news.smol.ai·Click to report a broken or paywalled link. Two distinct reports hide the row.
OpenAI is shipping three new voice models—GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper—that can reason in real time, translate across 70+ languages, and transcribe live speech. GPT-Realtime-2 brings…
via the-decoder.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Explore new realtime voice models in the OpenAI API that can reason, translate, and transcribe speech, enabling more natural and intelligent voice experiences.
MarkTechPost reports LightSeek Foundation’s MIT-licensed TokenSpeed inference engine (in preview) tailored for agentic workloads, claiming performance improvements over TensorRT-LLM on decode latency and throughput and…
via marktechpost.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
OpenAI expands Trusted Access for Cyber with GPT-5.5 and GPT-5.5-Cyber, helping verified defenders accelerate vulnerability research and protect critical infrastructure.
via openai.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Parloa leverages OpenAI models to power scalable, voice-driven AI customer service agents, enabling enterprises to design, simulate, and deploy reliable, real-time interactions.
via openai.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Nimbus builds production AI systems — internal tools, customer agents, retrieval pipelines — combining humans and AI end-to-end. From scoped pilot to production in 4–8 weeks.
Zyphra released ZAYA1-8B, a reasoning Mixture-of-Experts model, claiming strong math and coding benchmark performance relative to its size; it’s available under an Apache 2.0 license on Hugging Face and as a serverless…
via marktechpost.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Anthropic announced a new SpaceX compute partnership to significantly increase capacity for Claude products, doubling Claude Code's 5-hour rate limits for Pro, Max, Team, and Enterprise users, removing peak-hour limit…
via news.smol.ai·Click to report a broken or paywalled link. Two distinct reports hide the row.
Multipath Reliable Connection — a new transport protocol proven first and optimized on NVIDIA Spectrum-X Ethernet hardware — is now open to the industry.
via blogs.nvidia.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Mistral AI released Voxtral TTS, a 4B parameter model for multilingual voice cloning from 3 seconds of audio, outperforming competitors in evaluations.
via marktechpost.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
Perplexity launches Finance Search in the Agent API, combining licensed financial data, real-time market data, and cited sources for accurate financial agents.
via perplexity.ai·Click to report a broken or paywalled link. Two distinct reports hide the row.
Release v5.8.0 New Model additions DeepSeek-V4 DeepSeek-V4 is the next-generation MoE (Mixture of Experts) language model from DeepSeek that introduces several architectural innovations over DeepSeek-V3. The…
via github.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
We're releasing ten new Cowork and Claude Code plugins, integrations with the Microsoft 365 suite, new connectors, and an MCP app for financial services and insurance organizations.
via anthropic.com·Click to report a broken or paywalled link. Two distinct reports hide the row.
from shipfeed · weekly digest
The week in AI, in one short email.
Friday digest. The 10 most important shipping events. No promo, no AI-summary slop. Three minutes to read, unsubscribe in one click.
Perplexity launches Computer for Professional Finance, tailored for finance workflows with accurate data and integrations like Teams and upcoming Excel.
via perplexity.ai·Click to report a broken or paywalled link. Two distinct reports hide the row.
vLLM v0.20.1 This is a patch release on top of `v0.20.0` primarily focused on DeepSeek V4 stabilization and performance improvements, along with several important bug fixes. DeepSeek V4 Base model support (#41006)…
via github.com·Click to report a broken or paywalled link. Two distinct reports hide the row.