§ feed · storyline
Reasoning models struggle to control their chains of thought, and that’s good
OpenAI introduces CoT-Control, finding that reasoning models cannot reliably steer their own chains of thought, which the lab says supports monitorability as an AI safety measure.
OpenAI introduces CoT-Control and finds reasoning models struggle to control their chains of thought, reinforcing monitorability as an AI safety safeguard.
§ sources1 publication · timeline below