§ feed · storyline
AI safety via debate
OpenAI proposes a safety technique called debate, in which AI agents argue opposing positions and a human judge determines the winner to guide training.
We’re proposing an AI safety technique which trains agents to debate topics with one another, using a human to judge who wins.
§ sources1 publication · timeline below
- openai.comAI safety via debateprimary