shipfeedAI news, curated daily

23:07:11 CET
20 MAY23:07:11shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Sakana AI and NVIDIA introduce TwELL for faster LLM inference

Sakana AI and NVIDIA introduce TwELL, a sparse data format with custom CUDA kernels delivering up to 21.9% training and 20.5% inference speedups in LLM feedforward layers.

May 11 · · primary fetch1 sourceupdated May 11 ·

Sakana AI and NVIDIA introduce TwELL, a sparse data format with custom CUDA kernels that achieves 20.5% inference and 21.9% training speedup in LLMs by targeting feedforward layers.

read full article on marktechpost.com
§ sources1 publication · timeline below
  1. marktechpost.comSakana AI and NVIDIA Introduce TwELL with CUDA Kernels for 20.5% Inference and 21.9% Training Speedup in LLMsprimary