shipfeedAI news, curated daily

01:22:19 CET
21 MAY01:22:19shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Moondream 2025.1.9: Structured Text, Enhanced OCR, Gaze Detection in a 2B Model

Moondream releases version 2025.1.9 of its 2B vision model, adding structured text output, enhanced OCR, and gaze detection alongside improved VRAM efficiency.

Jan 11 · · primary fetch1 sourceupdated Jan 11 ·

Moondream has released a new version that advances VRAM efficiency and adds structured output and gaze detection, marking a new frontier in vision model practicality. Discussions on Twitter highlighted advancements in reasoning models like OpenAI's o1, model distillation techniques, and new multimodal embedding models such as vdr-2b-multi-v1 and LLaVA-Mini, which significantly reduce computational costs.

Research on GANs and decentralized diffusion models showed improved stability and performance. Development tools like MLX and vLLM received updates for better portability and developer experience, while frameworks like LangChain and Qdrant enable intelligent data workflows. Company updates include new roles and team expansions at GenmoAI. "Efficiency tricks are all you need."

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiMoondream 2025.1.9: Structured Text, Enhanced OCR, Gaze Detection in a 2B Modelprimary