shipfeedAI news, curated daily

01:15:03 CET
21 MAY01:15:03shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

whisper.cpp v1.7.3-pre

whisper.cpp v1.7.3-pre releases as a pre-release build with Metal backend performance gains for beam search and quantized models, alongside a migration from the previous build system to CMake.

Dec 9 · · primary fetch1 sourceupdated Dec 9 ·

Overview Massive performance improvements for the Metal backend, especially for beams > 1. Especially for quantized models. Setting as "pre-release" since there have been major changes to the build system (now using CMake) and I wan't to gather some feedback about how well the project builds now on various platforms. Please leave comments in the discussion to help fix any remaining issues before the official release. | CPU | Config | Model | Th | FA | Enc. | Dec. | Bch5 | PP | Commit | | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | | M2 Ultra | Metal | tiny | 1 | 1 | 7.90 | 1.26 | 0.35 | 0.01 | ed733e8 | | M2 Ultra | Metal | tiny-q5_0 | 1 | 1 | 8.44 | 1.23 | 0.36 | 0.01 | ed733e8 | | M2 Ultra | Metal | tiny-q5_1 | 1 | 1 | 8.26 | 1.27 | 0.37 | 0.01 | ed733e8 | | M2 Ultra | Metal | tiny-q8_0 | 1 | 1 | 8.03 | 1.21 | 0.35 | 0.01 | ed733e8 | | M2 Ultra | Metal | base | 1 | 1 | 13.77 | 1.80 | 0.42 | 0.02 | ed733e8 | | M2 Ultra | Metal | base-q5_0 | 1 | 1 | 15.02 | 1.72 | 0.42 | 0.02 | ed733e8 | | M2 Ultra | Metal | base-q5_1 | 1 | 1 | 14.93 | 1.74 | 0.42 | 0.02 | ed733e8 | | M2 Ultra | Metal | base-q8_0 | 1 | 1 | 14.26 | 1.68 | 0.41 | 0.02 | ed733e8 | | M2 Ultra | Metal…

read full article on github.com
§ sources1 publication · timeline below
  1. github.comwhisper.cpp v1.7.3-preprimary