shipfeedAI news, curated daily

01:27:27 CET
21 MAY01:27:27shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Deep Neural Nets: 33 years ago and 33 years from now

Andrej Karpathy reproduces LeCun's 1989 backpropagation paper in PyTorch, using it as a case study on 33 years of progress in deep learning.

Mar 14 · · primary fetch1 sourceupdated Mar 14 ·

.post-header h1 { font-size: 35px; } .post pre, .post code { background-color: #fcfcfc; font-size: 13px; / make code smaller for this post... / } The Yann LeCun et al. (1989) paper Backpropagation Applied to Handwritten Zip Code Recognition is I believe of some historical significance because it is, to my knowledge, the earliest real-world application of a neural net trained end-to-end with backpropagation. Except for the tiny dataset (7291 16x16 grayscale images of digits) and the tiny neural network used (only 1,000 neurons), this paper reads remarkably modern today, 33 years later - it lays out a dataset, describes the neural net architecture, loss function, optimization, and reports the experimental classification error rates over training and test sets.

It’s all very recognizable and type checks as a modern deep learning paper, except it is from 33 years ago. So I set out to reproduce the paper 1) for fun, but 2) to use the exercise as a case study on the nature of progress in deep learning. Implementation. I tried to follow the paper as close as possible and re-implemented everything in PyTorch in this karpathy/lecun1989-repro github repo. The original network was…

read full article on karpathy.github.io
§ sources1 publication · timeline below
  1. karpathy.github.ioDeep Neural Nets: 33 years ago and 33 years from nowprimary