shipfeedAI news, curated daily

02:03:31 CET
21 MAY02:03:31shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Titans: Learning to Memorize at Test Time

Google publishes research on Titans, a neural memory architecture that integrates persistent memory into transformers at test time to improve long-context utilisation.

Jan 16 · · primary fetch1 sourceupdated Jan 16 ·

Google released a new paper on "Neural Memory" integrating persistent memory directly into transformer architectures at test time, showing promising long-context utilization. MiniMax-01 by @omarsar0 features a 4 million token context window with 456B parameters and 32 experts, outperforming GPT-4o and Claude-3.5-Sonnet. InternLM3-8B-Instruct is an open-source model trained on 4 trillion tokens with state-of-the-art results.

Transformer² introduces self-adaptive LLMs that dynamically adjust weights for continuous adaptation. Advances in AI security highlight the need for agent authentication, prompt injection defenses, and zero-trust architectures. Tools like Micro Diffusion enable budget-friendly diffusion model training, while LeagueGraph and Agent Recipes support open-source social media agents.

read full article on news.smol.ai
§ sources1 publication · timeline below
  1. news.smol.aiTitans: Learning to Memorize at Test Timeprimary