shipfeedAI news, curated daily

01:17:59 CET
21 MAY01:17:59shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Introducing HealthBench

OpenAI releases HealthBench, a physician-informed evaluation benchmark designed to assess AI model performance and safety across realistic healthcare scenarios.

May 12 · · primary fetch1 sourceupdated May 12 ·

HealthBench is a new evaluation benchmark for AI in healthcare which evaluates models in realistic scenarios. Built with input from 250+ physicians, it aims to provide a shared standard for model performance and safety in health.

read full article on openai.com
§ sources1 publication · timeline below
  1. openai.comIntroducing HealthBenchprimary