Quantifying generalization in reinforcement learning
OpenAI releases CoinRun, a training environment designed to measure how well reinforcement learning agents generalise to novel situations beyond their training conditions.
We’re releasing CoinRun, a training environment which provides a metric for an agent’s ability to transfer its experience to novel situations and has already helped clarify a longstanding puzzle in reinforcement learning.
CoinRun strikes a desirable balance in complexity: the environment is simpler than traditional platformer games like Sonic the Hedgehog but still poses a worthy generalization challenge for state of the art algorithms.