§ feed · storyline

Introducing Active CPU pricing for Fluid compute

Vercel introduces Active CPU pricing for Fluid compute, charging CPU rates only during active code execution to reduce costs for I/O-bound workloads such as AI inference and agent servers.

Jun 25 · 15:00:00 · primary fetch1 sourceupdated Jun 25 · 15:00:00

exists for a new class of workloads. I/O bound backends like AI inference, agents, MCP servers, and anything that needs to scale instantly, but often remains idle between operations. These workloads do not follow traditional, quick request-response patterns. They’re long-running, unpredictable, and use cloud resources in new ways.Fluid compute on Vercel, helping teams cut costs by up to 85% through optimizations like in-function concurrency.Fluid quickly became the default compute model Today, we’re taking the efficiency and cost savings further with a new pricing model: you pay CPU rates only when your code is actively using CPU. Read more

read full article on vercel.com ↗

§ sources1 publication · timeline below

vercel.comIntroducing Active CPU pricing for Fluid computeprimary15:00:00