§ feed · storyline
Beyond Standard LLMs
Beyond Standard LLMs
Linear Attention Hybrids, Text Diffusion, Code World Models, and Small Recursive Transformers
§ sources1 publication · timeline below
- magazine.sebastianraschka.comBeyond Standard LLMsprimary