Transformers v5.1.0
Hugging Face releases Transformers v5.1.0, adding support for LG AI Research's EXAONE-MoE, PP-DocLayoutV3, Youtu-LLM, and GLM-OCR models.
New Model additions EXAONE-MoE K-EXAONE is a large-scale multilingual language model developed by LG AI Research. Built using a Mixture-of-Experts architecture, K-EXAONE features 236 billion total parameters, with 23 billion active during inference. Performance evaluations across various benchmarks demonstrate that K-EXAONE excels in reasoning, agentic capabilities, general knowledge, multilingual understanding, and long-context processing. Add EXAONE-MoE implementations (#43080) by @nuxlear PP-DocLayoutV3 PP-DocLayoutV3 is a unified and high-efficiency model designed for comprehensive layout analysis.
It addresses the challenges of complex physical distortions—such as skewing, curving, and adverse lighting—by integrating instance segmentation and reading order prediction into a single, end-to-end framework. [Model] Add PP-DocLayoutV3 Model Support (#43098) by @zhang-prog Youtu-LLM Youtu-LLM is a new, small, yet powerful LLM, contains only 1.96B parameters, supports 128k long context, and has native agentic talents. On general evaluations, Youtu-LLM significantly outperforms SOTA LLMs of similar size in terms of Commonsense, STEM, Coding and Long Context capabilities; in…