OpenAI o1 Model Outperforms Doctors on Clinical Reasoning Tasks, Study Finds

A new study published in Science found that OpenAI's o1 reasoning model surpassed human physicians in diagnostic and clinical reasoning tasks, including emergency department triage. The text-only AI, released in September 2024, excelled in managing clinical vignettes and real-world assessments.

Apr 30, 3:47 PM(81 days ago)·1m read88 sources

ER+85

OpenAI o1 Model Outperforms Doctors on Clinical Reasoning Tasks, Study Finds

Audio version

Tap play to generate a narrated version.

A study published in Science revealed that OpenAI's o1 reasoning model outperformed human doctors in several clinical reasoning tasks. The AI exceeded the performance of both GPT-4 and physicians in handling clinical vignettes and conducting initial triage in a real-world emergency department setting.

The o1 model, a text-only large language model released by OpenAI in September 2024, was tested on diagnostic tasks, identifying likely diagnoses, and determining next steps in patient management. In emergency room scenarios using real data, the AI demonstrated superior accuracy compared to human doctors.

They also noted that doctors cannot be removed from the diagnostic process based on these findings. The study raises questions about the future evaluation and implementation of AI tools in clinical care. No contradictions appeared across the sources, which all described the study's outcomes consistently.

Background on the Model OpenAI released the o1 model in September 2024 as an advancement in reasoning capabilities. The research, published on Thursday, utilized real emergency department data to assess the AI's performance. This marks a step in evaluating how AI can assist in medical decision-making without replacing human expertise.

ai healthcare technology research

Transparency

How sources framed this

ER+85

These outlets didn't split into competing frames — coverage was uniform.

CorroborationLimited · 88 sources

OpenAI o1 Model Outperforms Doctors on Clinical Reasoning Tasks, Study Finds

Apr 30, 3:47 PM(81 days ago)·1m read88 sources

ER+85

OpenAI o1 Model Outperforms Doctors on Clinical Reasoning Tasks, Study Finds

Transparency

OpenAI o1 Model Outperforms Doctors on Clinical Reasoning Tasks, Study Finds

Transparency

Story details

Related Stories

U.S. to Apply IP Enforcement Tools to Chinese AI Models After Chip Restrictions; Open Source Treated as Legitimate

Chinese Open-Source AI Model Ranks Fifth on Usage Platform, Trails Top U.S. System on Intelligence Tests

Alphabet Developing Frozen v2 Chip for Gemini AI Models, Planned for 2028 Release

Related Stories

U.S. to Apply IP Enforcement Tools to Chinese AI Models After Chip Restrictions; Open Source Treated as Legitimate

Chinese Open-Source AI Model Ranks Fifth on Usage Platform, Trails Top U.S. System on Intelligence Tests

Alphabet Developing Frozen v2 Chip for Gemini AI Models, Planned for 2028 Release