§ feed · storyline
Introducing HealthBench
OpenAI releases HealthBench, a physician-informed evaluation benchmark designed to assess AI model performance and safety across realistic healthcare scenarios.
HealthBench is a new evaluation benchmark for AI in healthcare which evaluates models in realistic scenarios. Built with input from 250+ physicians, it aims to provide a shared standard for model performance and safety in health.
§ sources1 publication · timeline below
- openai.comIntroducing HealthBenchprimary