shipfeedAI news, curated daily

01:16:28 CET
21 MAY01:16:28shipfeed
pull to refreshlast sync
Just in — 30 new
§ tools · storyline

Transformers v5.5.4

Hugging Face releases Transformers v5.5.4, patching the Kimi-K2.5 tokenizer regression, a DeepSpeed ZeRO-3 IndexError, and a Qwen2.5-VL temporal RoPE scaling bug affecting still images.

Apr 13 · · primary fetch1 sourceupdated Apr 13 ·

Patch release v5.5.4 This is mostly some fixes that are good to have asap, mostly for tokenizers; ** Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex Attribute… (#45305) by ArthurZucker For training: ** Fix #45305 + add regression test GAS (#45349) by florian6973, SunMarc ** Fix IndexError with DeepSpeed ZeRO-3 when kernels rotary is active (#…) by ArthurZucker And for Qwen2.5-VL : ** Fix Qwen2.5-VL temporal RoPE scaling applied to still images (#45330) by Kash6, zucchini-nlp

read full article on github.com
§ sources1 publication · timeline below
  1. github.comtransformers v5.5.4primary