shipfeedAI news, curated daily

01:27:08 CET
21 MAY01:27:08shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Faulty reward functions in the wild

Faulty reward functions in the wild

Dec 21 · · primary fetch1 sourceupdated Dec 21 ·

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

read full article on openai.com
§ sources1 publication · timeline below
  1. openai.comFaulty reward functions in the wildprimary