§ local-llm · storyline
Ollama v0.20.4
Ollama releases v0.20.4 with improved M5 performance via NAX on MLX and flash attention enabled for Gemma 4 models.
What's Changed mlx: Improve M5 performance with NAX gemma4: enable flash attention Full Changelog: https://github.com/ollama/ollama/compare/v0.20.3...v0.20.4
§ sources1 publication · timeline below
- github.comollama v0.20.4primary