shipfeedAI news, curated daily

01:27:32 CET
21 MAY01:27:32shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

OpenAI Voice Mode Can See Now

OpenAI launches Realtime Video capability for Voice Mode, enabling the assistant to process live video input alongside audio.

Dec 18 · · primary fetch1 sourceupdated Dec 18 ·

OpenAI launched Realtime Video shortly after Gemini, which led to less impact due to Gemini's earlier arrival with lower cost and fewer rate limits. Google DeepMind released Gemini 2.0 Flash featuring enhanced multimodal capabilities and real-time streaming. Anthropic introduced Clio, a system analyzing real-world usage of Claude models. Together Computing acquired CodeSandbox to launch a code interpreter tool. Discussions highlighted Meta's Llama 3.3-70B for its advanced roleplay and prompt handling abilities, outperforming models like Mistral Large and GPT-4o in expressiveness and censorship.

The AI community also engaged in humorous takes on AI outages and model competition, with ChatGPT adding a Santa mode for holiday interactions. "Anthropic is capturing the developer ecosystem, Gemini has AI enthusiast mindshare, ChatGPT reigns over AI dabblers" was a noted observation from the community.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiOpenAI Voice Mode Can See Nowprimary