§ feed · storyline

OpenAI and Anthropic share findings from a joint safety evaluation

OpenAI and Anthropic publish joint safety evaluation findings, testing each other's models for misalignment, hallucinations, and jailbreaking in a cross-lab collaboration.

Aug 27 · 12:00:00 · primary fetch1 sourceupdated Aug 27 · 12:00:00

OpenAI and Anthropic share findings from a first-of-its-kind joint safety evaluation, testing each other’s models for misalignment, instruction following, hallucinations, jailbreaking, and more—highlighting progress, challenges, and the value of cross-lab collaboration.

read full article on openai.com ↗

§ sources1 publication · timeline below

openai.comOpenAI and Anthropic share findings from a joint safety evaluationprimary12:00:00