Shipfeed. AI News Channel

items10 latest

▶ ai·08:31

Which Agent Causes Task Failures and When?Researchers from PSU and Duke explores automated failure attribution of LLM Multi-Agent Systems

In recent years, LLM Multi-Agent systems have garnered widespread attention for their collaborative approach to solving complex problems. However, it's a common scenario for these systems to fail at a task despite a…

Synced

▶ ai·11:17

ByteDance Introduces Astra: A Dual-Model Architecture for Autonomous Robot Navigation

ByteDance introduces Astra, an innovative dual-model architecture revolutionizing robot navigation in complex indoor environments. ByteDance Introduces Astra: A Dual-Model Architecture for Autonomous Robot Navigation…

Synced

▶ ai·14:58

MIT Researchers Unveil “SEAL”: A New Step Towards Self-Improving AI

MIT introduces SEAL, a framework enabling large language models to self-edit and update their weights via reinforcement learning. MIT Researchers Unveil “SEAL”: A New Step Towards Self-Improving AI first appeared on…

Synced

▶ ai·09:39

Researchers from PSU and Duke introduce “Multi-Agent Systems Automated Failure Attribution

"Automated failure attribution" is a crucial component in the development lifecycle of Multi-Agent systems. It has the potential to transform the challenge of identifying "what went wrong and who is to blame" from a…

Synced

▶ ai·11:31

Adobe Research Unlocking Long-Term Memory in Video World Models with State-Space Models

By combining State-Space Models (SSMs) for efficient long-range dependency modeling with dense local attention for coherence, and using training strategies like diffusion forcing and frame local attention, researchers…

Synced

▶ deepseek·19:58

DeepSeek-V3 New Paper is coming! Unveiling the Secrets of Low-Cost Large Model Training through Hardware-Aware Co-design

A newly released 14-page technical paper from the team behind DeepSeek-V3, with DeepSeek CEO Wenfeng Liang as a co-author, sheds light on the “Scaling Challenges and Reflections on Hardware for AI Architectures.”…

Synced

▶ deepseek·17:46

DeepSeek Unveils DeepSeek-Prover-V2: Advancing Neural Theorem Proving with Recursive Proof Search and a New Benchmark

DeepSeek AI releases DeepSeek-Prover-V2, an open-source LLM for Lean 4 theorem proving. It uses recursive proof search with DeepSeek-V3 for training data and reinforcement learning, achieving top results on MiniF2F…

Synced

▶ ai·04:30

Can GRPO be 10x Efficient? Kwai AI’s SRPO Suggests Yes with SRPO

Kwai AI's SRPO framework slashes LLM RL post-training steps by 90% while matching DeepSeek-R1 performance in math and code. This two-stage RL approach with history resampling overcomes GRPO limitations. Can GRPO be 10x…

Synced

▶ ai·14:23

Zhipu.AI’s Open-Source Power Play: Blazing-Fast GLM Models & Global Expansion Ahead of Potential IPO

Zhipu.AI open-sources faster GLM models (8x speedup), launches Z.ai, aiming for global expansion, potentially ahead of IPO. Zhipu.AI’s Open-Source Power Play: Blazing-Fast GLM Models & Global Expansion Ahead of…

Synced

▶ deepseek·16:43

DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach to Scaling Inference with SPCT

DeepSeek AI, a prominent player in the large language model arena, has recently published a research paper detailing a new technique aimed at enhancing the scalability of general reward models (GRMs) during the…

Synced