§ tools · storyline

llama.cpp b9151

llama.cpp releases build b9151 with logging reductions, server log clean-ups, prompt processing timings, and sampling parameter output across major platforms.

May 14 · 19:22:45 · primary fetch1 sourceupdated May 14 · 19:22:45

logs : reduce (#23021) logs : reduce args : fix envs server : fix build common : print verbosity level at start server : clean-up logs server : print prompt processing timings + sampling params minor : whitespaces macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu arm64 (Vulkan) Ubuntu x64 (ROCm 7.2) Ubuntu x64 (OpenVINO) Ubuntu x64 (SYCL FP32) Ubuntu x64 (SYCL FP16) Android: Android arm64 (CPU) Windows: Windows x64 (CPU) Windows arm64 (CPU) Windows x64 (CUDA 12) - CUDA 12.4 DLLs Windows x64 (CUDA 13) - CUDA 13.1 DLLs Windows x64 (Vulkan) Windows x64 (SYCL) Windows x64 (HIP) openEuler: openEuler x86 (310p) openEuler x86 (910b, ACL Graph) openEuler aarch64 (310p) openEuler aarch64 (910b, ACL Graph)

read full article on github.com ↗

§ sources3 publications · timeline below

github.comllama.cpp b9151primary19:22:45
github.comllama.cpp b915017:46:23
github.comllama.cpp b914817:11:36

§ how this story moved

17:11:36primary — llama.cpp — Releases publishes the launch post.
17:46:23llama.cpp — Releases picks up coverage.
19:22:45llama.cpp — Releases picks up coverage.