shipfeedAI news, curated daily

00:37:46 CET
21 MAY00:37:46shipfeed
pull to refreshlast sync
Just in — 30 new
§ tools · storyline

Transformers v5.7.0

Hugging Face releases Transformers v5.7.0, adding Poolside's Laguna mixture-of-experts model, the DEIMv2 real-time object detection model, and several attention bug fixes.

Apr 28 · · primary fetch1 sourceupdated Apr 28 ·

Release v5.7.0 New Model additions Laguna Laguna is Poolside's mixture-of-experts language model family that extends standard SwiGLU MoE transformers with two key innovations. It features per-layer head counts allowing different decoder layers to have different query-head counts while sharing the same KV cache shape, and implements a sigmoid MoE router with auxiliary-loss-free load balancing that uses element-wise sigmoid of gate logits plus learned per-expert bias for router scoring. Links: Documentation Laguna XS.2 implementation (#45673) by @joerowell in #45673 DEIMv2 DEIMv2 (DETR with Improved Matching v2) is a real-time object detection model that extends DEIM with DINOv3 features and spans eight model sizes from X to Atto for diverse deployment scenarios.

It uses a Spatial Tuning Adapter (STA) for larger variants to convert DINOv3's single-scale output into multi-scale features, while ultra-lightweight models employ pruned HGNetv2 backbones. The unified design achieves superior performance-cost trade-offs, with DEIMv2-X reaching 57.8 AP with only 50.3M parameters and DEIMv2-S being the first sub-10M model to exceed 50 AP on COCO. Links: Documentation | Paper model: Add…

read full article on github.com
§ sources1 publication · timeline below
  1. github.comtransformers v5.7.0primary