§ local-llm · storyline

Ollama v0.30.0-rc31

Ollama v0.30.0-rc31 releases a pre-release build that migrates its architecture to llama.cpp directly, adds GGUF compatibility, and uses MLX for accelerated inference on Apple Silicon.

May 13 · 16:32:54 · primary fetch1 sourceupdated May 13 · 16:32:54

This version of Ollama will change the architecture to directly support llama.cpp instead of building on top of GGML, and allows for compatibility with GGUF file format. MLX is used to accelerate model inference on Apple Silicon. While in pre-release we'd love feedback on: Performance improvements or degradation Errors or crashes that did not previously occur Memory utilization improvements or degradation Known issues: `laguna-xs.2` is not yet supported on Windows/Linux.

`llama3.2-vision` is not yet supported `nomic-embed-text` now converts inputs to lowercase per the model card where prior Ollama versions incorrectly preserved mixed case Installing: Mac/Linux ``` curl -fsSL https://ollama.com/install.sh | OLLAMA_VERSION=0.30.0-rc31 sh ``` Windows ``` $env:OLLAMA_VERSION="0.30.0-rc31"; irm https://ollama.com/install.ps1 | iex ```

read full article on github.com ↗

§ sources1 publication · timeline below

github.comOllama v0.30.0-rc31primary16:32:54