§ feed · storyline
How we monitor internal coding agents for misalignment
OpenAI publishes details of its chain-of-thought monitoring approach for detecting misalignment in internal coding agents deployed in real-world settings.
How OpenAI uses chain-of-thought monitoring to study misalignment in internal coding agents—analyzing real-world deployments to detect risks and strengthen AI safety safeguards.
§ sources1 publication · timeline below