Harvard Study Finds AI Models Excel in Some Emergency Care Reasoning Tasks Compared to Physicians
A study by researchers at Harvard Medical School and Beth Israel Deaconess Medical Center found that artificial intelligence models, including OpenAI's o1-preview, surpassed human physicians in various clinical reasoning tasks. The AI excelled particularly in management reasoning and real-world emergency settings with limited information.
EuronewsArtificial intelligence models outperformed physicians in emergency care medical decisions, according to a study conducted by researchers at Harvard Medical School and Beth Israel Deaconess Medical Center in the United States. The study compared AI and physicians across a wide range of clinical reasoning tasks.
Euronews reported these findings, highlighting the AI's superior performance in most experiments.
Researchers evaluated o1-preview, OpenAI’s reasoning model released in 2024. They provided the model with a range of clinical cases, including published case conferences and real-world emergency department records. AI outperformed human physicians across most experiments, especially in management reasoning, clinical reasoning, documentation, and real-world emergency settings with limited information.
In one evaluation, researchers asked the LLM o1 and GPT-4o to assess patients at various points in a standard emergency department setting, from early triage to later admission decisions. At each stage, the model received only the information available at that point and was tasked with generating likely diagnoses and recommending next steps.
The biggest gap between AI and human physicians occurred in the triage stage, where patient information is more limited.
Peter Brodeur, co-first author and HMS clinical fellow in medicine at Beth Israel Deaconess, said, 'Models are increasingly capable. The study reflects only model performance and primarily focuses on the preview version of the o1 model. The preview version of the o1 model has since been supplanted by newer models such as OpenAI’s o3 model.
Key Facts
Story Timeline
5 events- 2026-05-05
Current date; study findings reported as recent by Euronews.
1 sourceEuronews - Post-2024
Preview version of o1 model supplanted by newer models such as OpenAI’s o3 model.
1 sourceEuronews - 2024
OpenAI released o1-preview reasoning model, which was evaluated in the study.
1 sourceEuronews - Pre-2026
Researchers conducted study comparing AI models and physicians in clinical tasks.
1 sourceEuronews - Pre-2024
Prior models and benchmarks used for comparison, with AI eclipsing them.
1 sourceEuronews
Potential Impact
- 01
Further studies on newer AI models like o3 to assess sustained or improved performance.
- 02
Shift in evaluation methods for AI beyond multiple-choice tests due to models reaching performance ceilings.
- 03
Increased adoption of AI in clinical decision support to reduce diagnostic errors and improve access in emergency settings.
- 04
Exploration of human-LLM collaboration to enhance clinical reasoning in medicine.
- 05
Potential risks of AI suggesting unnecessary tests, leading to patient harm if not balanced with human oversight.
Transparency Panel
Related Stories
naturalnews.comBrockman Testifies on Heated 2017 Dispute with Musk Over OpenAI's For-Profit Shift in Federal Trial
OpenAI President Greg Brockman detailed a heated 2017 confrontation with Elon Musk during testimony in the federal trial Musk v. Altman. He described Musk storming around a table and grabbing a painting after rejecting shared control proposals. The lawsuit seeks $150 billion in d…
thenation.comPublishing Houses, Scott Turow Sue Meta Over AI Training Data Copyright
Five major publishing houses and author Scott Turow filed a class action lawsuit against Meta and CEO Mark Zuckerberg, alleging the company illegally used millions of copyrighted books and journal articles to train its Llama AI model. The suit, filed in federal court in Manhattan…
Italian Prime Minister Meloni Warns of AI-Generated Deepfakes and Shares Altered Image
Italian Prime Minister Giorgia Meloni highlighted risks from AI-generated fake images, noting one depicting her in underwear and urging verification of online content. She filed a libel suit two years ago over similar deepfake images. Meanwhile, U.S. Secretary of State Marco Rubi…