shipfeedAI news, curated daily

00:32:43 CET
21 MAY00:32:43shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Block-sparse GPU kernels

OpenAI releases optimized GPU kernels for block-sparse neural networks, achieving faster performance than cuBLAS or cuSPARSE across text and image generative modelling tasks.

Dec 6 · · primary fetch1 sourceupdated Dec 6 ·

We’re releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending on the chosen sparsity, these kernels can run orders of magnitude faster than cuBLAS or cuSPARSE.

We’ve used them to attain state-of-the-art results in text sentiment analysis and generative modeling of text and images.

read full article on openai.com
§ sources1 publication · timeline below
  1. openai.comBlock-sparse GPU kernelsprimary