§ safety · storyline
A shared playbook for trustworthy third party evaluations
OpenAI publishes guidance on conducting trustworthy third-party evaluations of frontier AI models, covering capability assessment, safeguards, and validity.
OpenAI shares guidance on third-party AI evaluations, covering how to assess model capabilities, safeguards, and validity for frontier systems.
§ sources1 publication · timeline below