whisper.cpp v1.8.4
Overview Maintenance release, latest ggml, some performance gains across the board. What's Changed Fix: UTF-8 character cut off to two "�" in segment wrapping (max_len) by @lordofriver in…
https://github.com/ggerganov/whisper.cpp·tool·20 items·last fetched
Overview Maintenance release, latest ggml, some performance gains across the board. What's Changed Fix: UTF-8 character cut off to two "�" in segment wrapping (max_len) by @lordofriver in…
Overview Maintenance release, latest ggml, minor improvements in the tools/server/bindings. What's Changed server : set no_context == true by @ggerganov in https://github.com/ggml-org/whisper.cpp/pull/3482 WASM example…
Overview Fix a bug in the ggml norm CPU scalar operator What's Changed sync : ggml by @ggerganov in https://github.com/ggml-org/whisper.cpp/pull/3478 Full Changelog…
Overview Fix Vulkan builds Fix memory leaks when using VAD Support `--carry-initial-prompt` What's Changed bindings-java : disable flash attention by default by @danbev in…
Overview Flash attention is now enabled by default Performance improvements M1 Pro | CPU | Config | Model | Th | FA | Enc. | Dec. | Bch5 | PP | Commit | | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | |…
Overview Add initial VAD support - feedback welcome and appreciated Metal FA improvements M2 Ultra Flash Attention ON: | CPU | Config | Model | Th | FA | Enc. | Dec. | Bch5 | PP | Commit | | --- | --- | --- | --- |…
Overview This is a relatively big update with various build and CI improvements especially for iOS and WASM. There are also some performance gains, especially for the Metal backend and probably for Arm-based devices…
Overview Minor release with mostly build fixes. What's Changed whisper : rename binaries + fix install by @ggerganov in https://github.com/ggerganov/whisper.cpp/pull/2648 feat(server): Add option to suppress non-speech…
Overview Massive performance improvements for the Metal backend, especially for beams > 1 and for quantized models Reduce hallucinations during silence by @jkarthic in https://github.com/ggerganov/whisper.cpp/pull/2629…
Overview Massive performance improvements for the Metal backend, especially for beams > 1. Especially for quantized models. Setting as "pre-release" since there have been major changes to the build system (now using…
Overview Various improvements in the Metal backend Fix extra memory usage for large samples Remove limit for `ggml_context` (i.e. more beams and processors are supported) | CPU | Config | Model | Th | FA | Enc. | Dec…
Overview This is a pre-release since I think there have been some reports about memory leaks which I haven't had the time to investigate and confirm. If these are resolved in the next days, will add them to the…
Overview Fix Vulkan crashes Performance stats for Vulkan on RTX 2060 | GPU | Config | Model | Th | FA | Enc. | Dec. | Bch5 | PP | Commit | | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | | RTX 2060 |…
Overview Fix crashes with high number of beams Reduce overal VRAM usage Optimize Encoder performance Some performance numbers for this release: M2 Ultra Flash Attention ON: | GPU | Config | Model | Th | FA | Enc. |…
Overview Bugfix when using multiple `whisper_state` in parallel: https://github.com/ggerganov/whisper.cpp/pull/2182 What's Changed Update ruby bindings by @taf2 in https://github.com/ggerganov/whisper.cpp/pull/2154…
Minor release adding initial ffmpeg support in the examples https://github.com/ggerganov/whisper.cpp/pull/2133 (thx @WilliamTambellini) What's Changed ci: Update build.yml to suppress warnings about node.js versions by…
Overview Can optionally enable Flash Attention for faster processing on CUDA and Metal devices (#2152) Faster ppc64 performance (40aeeeecc4b8700b2a7e50cbcfa5c5412f2626ab) (not tested) Fix `main` slowdown bug (#2070)…
Overview Many small incremental updates + Token level timestamps with DTW by @denersc in https://github.com/ggerganov/whisper.cpp/pull/1485 Feedback is welcome! Full Changelog…
Overview Faster Core ML ANE models (#1716) CUDA bugfix causing random erros in the transcription Fix SwiftUI example build Full Changelog: https://github.com/ggerganov/whisper.cpp/compare/v1.5.3...v1.5.4