§ feed · storyline
Evaluating AI’s ability to perform scientific research tasks
OpenAI introduces FrontierScience, a benchmark evaluating AI reasoning across physics, chemistry, and biology to measure progress toward autonomous scientific research.
OpenAI introduces FrontierScience, a benchmark testing AI reasoning in physics, chemistry, and biology to measure progress toward real scientific research.
§ sources1 publication · timeline below