§ agents · storyline

Thinking Machines Lab ships its first model and argues interactivity is what OpenAI gets wrong about voice

Thinking Machines Lab releases its first multimodal AI model, processing audio, video, and text in 200-millisecond chunks and targeting OpenAI and Google in real-time voice interaction quality.

May 12 · 15:16:03 · primary fetch1 sourceupdated May 12 · 15:16:03

Mira Murati's start-up presents its first AI model and aims to free voice AI from the question-and-answer model. The model processes audio, video and text in 200-millisecond chunks in parallel and aims to beat OpenAI's GPT Realtime 2 and Google's Gemini Live in terms of interaction quality.

The article Thinking Machines Lab ships its first model and argues interactivity is what OpenAI gets wrong about voice appeared first on The Decoder.

read full article on the-decoder.com ↗

§ sources1 publication · timeline below

the-decoder.comThinking Machines Lab ships its first model and argues interactivity is what OpenAI gets wrong about voiceprimary15:16:03