shipfeedAI news, curated daily

17:03:29 CET
29 JUN17:03:29shipfeed
pull to refreshlast sync
Just in — 8 new
§ topic

local-llm

44 stories · 7d·6 sources covering·30 active storylines

Updated Fri, 19 Jun 2026 CEST·44 new storylines this week·live

What this is

Local LLMs are open-weight models you can run on your own hardware. shipfeed tracks open-weight releases, quantization, and local inference tools like llama.cpp and Ollama.

storylines this week30 active

llama.cpp — Releases
LLAMA.CPP · b9496

Fixes Gemma 4 unified FPE on mtmd

mtmd: fix Gemma 4 unified FPE (#24088) macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu…

via github.com
Wednesday, June 10, 2026’s edition
Monday, June 1, 2026’s edition
Friday, April 3, 2026’s edition
Smol AI — Daily
AI · 1 source

not much happened today

Gemma 4 was launched by Google under an Apache 2.0 license, marking a significant open-model release focused on reasoning, agentic workflows, multimodality, and on-device use. It outperforms models 10x larger and has…

via news.smol.ai
Wednesday, March 11, 2026’s edition
Smol AI — Daily
AI · 1 source

not much happened today

NVIDIA’s Nemotron 3 Super is a 120B parameter / ~12B active open model featuring a hybrid Mamba-Transformer / SSM Latent MoE architecture and 1M context window, delivering up to 2.2x faster inference than GPT-OSS-120B…

via news.smol.ai
Friday, June 19, 2026’s edition
Wednesday, June 3, 2026’s edition
Tuesday, June 2, 2026’s edition
llama.cpp — Releases
LLAMA.CPP · b9468

Adds real-time reasoning interruption via POST

server: real-time reasoning interruption via control endpoint (#23971) server: real-time reasoning interruption via control endpoint Builds on the manual reasoning budget trigger from #23949. Adds a CONTROL task that…

via github.com
Monday, June 1, 2026’s edition
Friday, May 29, 2026’s edition
Tuesday, May 26, 2026’s edition
Thursday, May 21, 2026’s edition
The Decoder
AI · 1 source

Cohere open-sources its strongest model yet

The Canadian AI company Cohere is releasing its most powerful language model to date, Command A+, as open source under an Apache 2.0 license. The article Cohere open-sources its strongest model yet appeared first on…

via the-decoder.com
Saturday, May 16, 2026’s edition
Friday, May 15, 2026’s edition
Wednesday, May 13, 2026’s edition
Ollama — Releases
OLLAMA · 1 source

Ollama v0.30.0-rc17

This version of Ollama will change the architecture to directly support llama.cpp instead of building on top of GGML, and allows for compatibility with GGUF file format. MLX is used to accelerate model inference on…

via github.com
Ollama — Releases
OLLAMA · 1 source

Ollama v0.30.0-rc27

This version of Ollama will change the architecture to directly support llama.cpp instead of building on top of GGML, and allows for compatibility with GGUF file format. MLX is used to accelerate model inference on…

via github.com
Tuesday, May 5, 2026’s edition
transformers — Releases
AI · 1 source

Transformers v5.8.0

Release v5.8.0 New Model additions DeepSeek-V4 DeepSeek-V4 is the next-generation MoE (Mixture of Experts) language model from DeepSeek that introduces several architectural innovations over DeepSeek-V3. The…

via github.com
Monday, April 27, 2026’s edition
vLLM — Releases
AI · 1 source

vLLM v0.20.0

vLLM v0.20.0 Highlights This release features 752 commits from 320 contributors (123 new)! DeepSeek V4: Initial DeepSeek V4 support landed (#40860), with DSML token-leakage fix in DSV4/3.2 (#40806), DSA + MTP IMA fix…

via github.com
Wednesday, April 22, 2026’s edition
Smol AI — Daily
AI · 1 source

not much happened today

Alibaba released Qwen3.6-27B, a dense, Apache 2.0 open coding model with thinking and non-thinking modes, outperforming the larger Qwen3.5-397B-A17B on multiple coding benchmarks including SWE-bench and Terminal-Bench…

via news.smol.ai
Monday, April 20, 2026’s edition
Smol AI — Daily
AI · 1 source

not much happened today

Moonshot's Kimi K2.6 is a major open-weight 1T-parameter MoE model featuring 32B active parameters, 384 experts, MLA attention, 256K context window, native multimodality, and INT4 quantization. It supports day-0…

via news.smol.ai
Friday, April 3, 2026’s edition
vLLM — Releases
AI · 1 source

vLLM v0.19.0

vLLM v0.19.0 Highlights This release features 448 commits from 197 contributors (54 new)! Gemma 4 support: Full Google Gemma 4 architecture support including MoE, multimodal, reasoning, and tool-use capabilities…

via github.com
Thursday, April 2, 2026’s edition
Ollama — Releases
OLLAMA · 1 source

Ollama v0.20.0

Gemma 4 Effective 2B (E2B) ``` ollama run gemma4:e2b ``` Effective 4B (E4B) ``` ollama run gemma4:e4b ``` 26B (Mixture of Experts model with 4B active parameters) ``` ollama run gemma4:26b ``` 31B (Dense) ``` ollama…

via github.com
Wednesday, February 25, 2026’s edition
vLLM — Releases
AI · 1 source

vLLM v0.16.0

vLLM v0.16.0 Please note that this release was branch cut on Feb 8, so any features added to vLLM after that date is not included. Highlights This release features 440 commits from 203 contributors (7 new)! Async…

via github.com
Tuesday, January 20, 2026’s edition
vLLM — Releases
AI · 1 source

vLLM v0.14.0

Highlights This release features approximately 660 commits from 251 contributors (86 new contributors). Breaking Changes: Async scheduling is now enabled by default - Users who experience issues can disable with…

via github.com
Friday, December 26, 2025’s edition
Smol AI — Daily
AGENTS · 1 source

not much happened today

MiniMax M2.1 launches as an open-source agent and coding Mixture-of-Experts (MoE) model with ~10B active / ~230B total parameters, claiming to outperform Gemini 3 Pro and Claude Sonnet 4.5, and supports local inference…

via news.smol.ai
Tuesday, December 23, 2025’s edition
Smol AI — Daily
AI · 1 source

not much happened today

GLM-4.7 and MiniMax M2.1 open-weight model releases highlight day-0 ecosystem support, coding throughput, and agent workflows, with GLM-4.7 achieving a +9.5% improvement over GLM-4.6 and MiniMax M2.1 positioned as an…

via news.smol.ai
local-llm — shipfeed