§ feed · storyline

Cursor partners with Together AI for fast in-editor inference

Together AI partners with Cursor to build a real-time inference stack using NVIDIA Blackwell hardware and FP4/TensorRT quantization to keep in-editor agents fast and low-latency.

Jan 13 · 01:00:00 · primary fetch1 sourceupdated Jan 13 · 01:00:00

Together AI teamed with Cursor to build the real-time inference stack that keeps in-editor agents fast and reliable. They productionized NVIDIA Blackwell (B200/GB200), tuning ARM hosts, kernels, and FP4/TensorRT quantization for low latency and rapid model rollouts.

read full article on together.ai ↗

§ sources1 publication · timeline below

together.aiLearn how Cursor partnered with Together AI to deliver real-time, low-latency inference at scaleprimary01:00:00