shipfeedAI news, curated daily

01:22:16 CET
21 MAY01:22:16shipfeed
pull to refreshlast sync
Just in — 30 new
§ tools · storyline

Ollama v0.23.1

Ollama v0.23.1 adds Gemma 4 MTP speculative decoding for the MLX runner on Macs, enabling over 2x speed gains on the Gemma 4 31B coding model.

May 5 · · primary fetch1 sourceupdated May 5 ·

Gemma 4 MTP (Multi-token Processing) for the MLX runner Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. ``` ollama run gemma4:31b-coding-mtp-bf16 ``` What's Changed Update MLX and MLX-C with threading fixes by @dhiltgen in https://github.com/ollama/ollama/pull/15845 go: bump to 1.26 by @ParthSareen in https://github.com/ollama/ollama/pull/15904 Add Gemma 4 MTP speculative decoding by @pdevine in https://github.com/ollama/ollama/pull/15980 Full Changelog: https://github.com/ollama/ollama/compare/v0.23.0...v0.23.1

read full article on github.com
§ sources1 publication · timeline below
  1. github.comollama v0.23.1primary