§ feed · storyline

Faulty reward functions in the wild

Dec 21 · 09:00:00 · primary fetch1 sourceupdated Dec 21 · 09:00:00

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

read full article on openai.com ↗

§ sources1 publication · timeline below

openai.comFaulty reward functions in the wildprimary09:00:00