shipfeedAI news, curated daily

23:53:02 CET
20 MAY23:53:02shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Transformers v5.5.2

Hugging Face releases Transformers v5.5.2, a patch fixing Gemma4 inference with use_cache=False, MoE tensor-parallel support, shared-weight handling, and VLM conversion mapping bugs.

Apr 9 · · primary fetch1 sourceupdated Apr 9 ·

Small patch dedicated to optimizing gemma4, fixing inference with `use_cache=False` due to k/v states sharing between layers, as well as conversion mappings for some models that would inconsistently serialize their weight names. It contains the following PRs: Add MoE to Gemma4 TP plan (#45219) by @sywangyi and @Cyrilvallez [gemma4] Dissociate kv states sharing from the Cache (#45312) by @Cyrilvallez [gemma4] Remove all shared weights, and silently skip them during loading (#45336) by @Cyrilvallez Fix conversion mappings for vlms (#45340) by @Cyrilvallez

read full article on github.com
§ sources1 publication · timeline below
  1. github.comtransformers v5.5.2 — Patch release: v5.5.2primary