§ feed · storyline
LightSeek Foundation releases TokenSpeed, open-source LLM inference
LightSeek Foundation releases TokenSpeed, an MIT-licensed open-source LLM inference engine in preview targeting agentic workloads with a KV-cache-safety scheduler.
MarkTechPost reports LightSeek Foundation’s MIT-licensed TokenSpeed inference engine (in preview) tailored for agentic workloads, claiming performance improvements over TensorRT-LLM on decode latency and throughput and describing a KV-cache-safety scheduler design.
§ sources1 publication · timeline below