shipfeedAI news, curated daily

01:15:11 CET
21 MAY01:15:11shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Gathering human feedback

RL-Teacher releases as an open-source tool for training AI agents using periodic human feedback in place of manually designed reward functions.

Aug 3 · · primary fetch1 sourceupdated Aug 3 ·

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

read full article on openai.com
§ sources1 publication · timeline below
  1. openai.comGathering human feedbackprimary