§ feed · storyline

Our approach to alignment research

Anthropic outlines its alignment research approach, focusing on learning from human feedback and building AI systems capable of assisting humans in evaluating and solving further alignment problems.

Aug 24 · 09:00:00 · primary fetch1 sourceupdated Aug 24 · 09:00:00

We are improving our AI systems’ ability to learn from human feedback and to assist humans at evaluating AI.

Our goal is to build a sufficiently aligned AI system that can help us solve all other alignment problems.

read full article on openai.com ↗

§ sources1 publication · timeline below

openai.comOur approach to alignment researchprimary09:00:00