shipfeedAI news, curated daily

00:39:09 CET
21 MAY00:39:09shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

OpenAI Titan XPU: 10GW of self-designed chips with Broadcom

OpenAI finalizes a custom ASIC chip design with Broadcom targeting 10GW of inference compute, part of a broader roadmap to reach 250GW total capacity.

Oct 13 · · primary fetch1 sourceupdated Oct 13 ·

OpenAI is finalizing a custom ASIC chip design to deploy 10GW of inference compute, complementing existing deals with NVIDIA (10GW) and AMD (6GW). This marks a significant scale-up from OpenAI's current 2GW compute, aiming for a roadmap of 250GW total, which is half the energy consumption of the US. Greg from OpenAI highlights the shift of ChatGPT from interactive use to always-on ambient agents requiring massive compute, emphasizing the challenge of building chips for billions of users.

The in-house ASIC effort was driven by the need for tailored designs after limited success influencing external chip startups. Broadcom's stock surged 10% on the news. Additionally, InferenceMAX reports improved ROCm stability and nuanced performance comparisons between AMD MI300X and NVIDIA H100/H200 on llama-3-70b FP8 workloads, with RL training infrastructure updates noted.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiOpenAI Titan XPU: 10GW of self-designed chips with Broadcomprimary