§ feed · storyline
Scaling Kubernetes to 7,500 nodes
OpenAI scales Kubernetes clusters to 7,500 nodes to support large model training for GPT-3, CLIP, and DALL·E as well as smaller iterative research workloads.
We’ve scaled Kubernetes clusters to 7,500 nodes, producing a scalable infrastructure for large models like GPT-3, CLIP, and DALL·E, but also for rapid small-scale iterative research such as Scaling Laws for Neural Language Models.
§ sources1 publication · timeline below
- openai.comScaling Kubernetes to 7,500 nodesprimary