§ feed · storyline
Nous Research speeds up LLM pre-training by 2.5x with token
Nous Research releases Token Superposition Training, accelerating LLM pre-training by up to 2.5x across 270M–10B parameter models without architectural changes.
Nous Research released Token Superposition Training (TST), accelerating LLM pre-training by up to 2.5x across models from 270M to 10B parameters without changing architecture or data.
§ sources1 publication · timeline below