§ feed · storyline

Titans: Learning to Memorize at Test Time

Google publishes research on Titans, a neural memory architecture that integrates persistent memory into transformers at test time to improve long-context utilisation.

Jan 16 · 08:58:41 · primary fetch1 sourceupdated Jan 16 · 08:58:41

Google released a new paper on "Neural Memory" integrating persistent memory directly into transformer architectures at test time, showing promising long-context utilization. MiniMax-01 by @omarsar0 features a 4 million token context window with 456B parameters and 32 experts, outperforming GPT-4o and Claude-3.5-Sonnet. InternLM3-8B-Instruct is an open-source model trained on 4 trillion tokens with state-of-the-art results.

Transformer² introduces self-adaptive LLMs that dynamically adjust weights for continuous adaptation. Advances in AI security highlight the need for agent authentication, prompt injection defenses, and zero-trust architectures. Tools like Micro Diffusion enable budget-friendly diffusion model training, while LeagueGraph and Agent Recipes support open-source social media agents.

read full article on news.smol.ai ↗

§ sources1 publication · timeline below

news.smol.aiTitans: Learning to Memorize at Test Timeprimary08:58:41