shipfeedAI news, curated daily

23:06:07 CET
20 MAY23:06:07shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

OpenAI Realtime API and other Dev Day Goodies

OpenAI launches the gpt-4o-realtime-preview API with text and audio processing, voice activity detection, function calling, and partnerships with LiveKit, Agora, and Twilio for voice agent use cases.

Oct 2 · · primary fetch1 sourceupdated Oct 2 ·

OpenAI launched the gpt-4o-realtime-preview Realtime API featuring text and audio token processing with pricing details and future plans including vision and video support. The API supports voice activity detection modes, function calling, and ephemeral sessions with auto-truncation for context limits. Partnerships with LiveKit, Agora, and Twilio enhance audio components and AI virtual agent voice calls.

Additionally, OpenAI introduced vision fine-tuning with only 100 examples improving mapping accuracy for Grab and RPA success for Automat. Model distillation and prompt caching features were also announced, including free eval inference for users opting to share data.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiOpenAI Realtime API and other Dev Day Goodiesprimary