shipfeedAI news, curated daily

23:52:19 CET
20 MAY23:52:19shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

UC Berkeley's EPIC Lab releases DocETL, an agentic query rewriting and evaluation system for complex document processing using compound LLM-based data operators.

Oct 22 · · primary fetch1 sourceupdated Oct 22 ·

UC Berkeley's EPIC lab introduces innovative LLM data operators with projects like LOTUS and DocETL, focusing on effective programming and computation over large data corpora. This approach contrasts GPU-rich big labs like Deepmind and OpenAI with GPU-poor compound AI systems. Microsoft open-sourced BitNet b1.58, a 1-bit ternary parameter LLM enabling 4-20x faster training and on-device inference at human reading speeds.

Nvidia released Llama-3.1-Nemotron-70B-Instruct, a fine-tuned open-source model outperforming GPT-4o and Claude-3.5-sonnet. These developments highlight advances in model-optimization, on-device-ai, and fine-tuning.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiDocETL: Agentic Query Rewriting and Evaluation for Complex Document Processingprimary