shipfeedAI news, curated daily

23:05:46 CET
20 MAY23:05:46shipfeed
pull to refreshlast sync
Just in — 30 new
§ tools · storyline

Transformers v5.6.0

Hugging Face releases Transformers v5.6.0, adding OpenAI Privacy Filter for PII detection, Baidu's QianfanOCR 4B document intelligence model, and SAM3-LiteText.

Apr 22 · · primary fetch1 sourceupdated Apr 22 ·

Release v5.6.0 New Model additions OpenAI Privacy Filter OpenAI Privacy Filter is a bidirectional token-classification model for personally identifiable information (PII) detection and masking in text. It is intended for high-throughput data sanitization workflows where teams need a model that they can run on-premises that is fast, context-aware, and tunable. The model labels an input sequence in a single forward pass, then decodes coherent spans with a constrained Viterbi procedure, predicting probability distributions over 8 privacy-related output categories for each input token. Links: Documentation [`Privacy Filter`] Add model (#45580) by @vasqu in #45580 QianfanOCR Qianfan-OCR is a 4B-parameter end-to-end document intelligence model developed by Baidu that performs direct image-to-text conversion without traditional multi-stage OCR pipelines.

It supports a broad range of prompt-driven tasks including structured document parsing, table extraction, chart understanding, document question answering, and key information extraction all within one unified model. The model features a unique "Layout-as-Thought" capability that generates structured layout representations before…

read full article on github.com
§ sources1 publication · timeline below
  1. github.comtransformers v5.6.0primary