Substrate
ai

Harvard Study Finds AI Models Excel in Some Emergency Care Reasoning Tasks Compared to Physicians

A study by researchers at Harvard Medical School and Beth Israel Deaconess Medical Center found that artificial intelligence models, including OpenAI's o1-preview, surpassed human physicians in various clinical reasoning tasks. The AI excelled particularly in management reasoning and real-world emergency settings with limited information.

Euronews
1 source·May 5, 4:55 AM(1 day ago)·1m read
|
Harvard Study Finds AI Models Excel in Some Emergency Care Reasoning Tasks Compared to PhysiciansEuronews
Audio version
Tap play to generate a narrated version.
Developing·Limited corroboration so far. This page will refresh as more sources emerge.

Artificial intelligence models outperformed physicians in emergency care medical decisions, according to a study conducted by researchers at Harvard Medical School and Beth Israel Deaconess Medical Center in the United States. The study compared AI and physicians across a wide range of clinical reasoning tasks.

Euronews reported these findings, highlighting the AI's superior performance in most experiments.

Researchers evaluated o1-preview, OpenAI’s reasoning model released in 2024. They provided the model with a range of clinical cases, including published case conferences and real-world emergency department records. AI outperformed human physicians across most experiments, especially in management reasoning, clinical reasoning, documentation, and real-world emergency settings with limited information.

In one evaluation, researchers asked the LLM o1 and GPT-4o to assess patients at various points in a standard emergency department setting, from early triage to later admission decisions. At each stage, the model received only the information available at that point and was tasked with generating likely diagnoses and recommending next steps.

The biggest gap between AI and human physicians occurred in the triage stage, where patient information is more limited.

Peter Brodeur, co-first author and HMS clinical fellow in medicine at Beth Israel Deaconess, said, 'Models are increasingly capable. The study reflects only model performance and primarily focuses on the preview version of the o1 model. The preview version of the o1 model has since been supplanted by newer models such as OpenAI’s o3 model.

Key Facts

AI outperforms physicians
Artificial intelligence models outperformed physicians in emergency care medical decisions across most experiments, especially in management reasoning and real-
Study evaluation
Researchers evaluated OpenAI’s o1-preview model using clinical cases from published conferences and real-world emergency department records.
Triage performance gap
The largest performance gap between AI and physicians was in the triage stage with limited patient information.
Limitations noted
The study focuses on the o1-preview model, which has been replaced by newer versions like o3, and calls for further research on model variations and human-AI co
Potential benefits and risks
Authors suggest AI could mitigate diagnostic errors but warn of risks like unnecessary testing that could harm patients.

Story Timeline

5 events
  1. 2026-05-05

    Current date; study findings reported as recent by Euronews.

    1 sourceEuronews
  2. Post-2024

    Preview version of o1 model supplanted by newer models such as OpenAI’s o3 model.

    1 sourceEuronews
  3. 2024

    OpenAI released o1-preview reasoning model, which was evaluated in the study.

    1 sourceEuronews
  4. Pre-2026

    Researchers conducted study comparing AI models and physicians in clinical tasks.

    1 sourceEuronews
  5. Pre-2024

    Prior models and benchmarks used for comparison, with AI eclipsing them.

    1 sourceEuronews

Potential Impact

  1. 01

    Further studies on newer AI models like o3 to assess sustained or improved performance.

  2. 02

    Shift in evaluation methods for AI beyond multiple-choice tests due to models reaching performance ceilings.

  3. 03

    Increased adoption of AI in clinical decision support to reduce diagnostic errors and improve access in emergency settings.

  4. 04

    Exploration of human-LLM collaboration to enhance clinical reasoning in medicine.

  5. 05

    Potential risks of AI suggesting unnecessary tests, leading to patient harm if not balanced with human oversight.

Transparency Panel

Sources cross-referenced1
Confidence score75%
Synthesized bySubstrate AI
Word count233 words
PublishedMay 5, 2026, 4:55 AM
Bias signals removed4 across 4 outlets
Signal Breakdown
Loaded 1Speculative 1positive framing 1emphasizing disparity 1

Related Stories

Brockman Testifies on Heated 2017 Dispute with Musk Over OpenAI's For-Profit Shift in Federal Trialnaturalnews.com
ai1 hr agoUpdated

Brockman Testifies on Heated 2017 Dispute with Musk Over OpenAI's For-Profit Shift in Federal Trial

OpenAI President Greg Brockman detailed a heated 2017 confrontation with Elon Musk during testimony in the federal trial Musk v. Altman. He described Musk storming around a table and grabbing a painting after rejecting shared control proposals. The lawsuit seeks $150 billion in d…

The New York Times
Wired
New York Post
BBC News
Business Insider
+3
9 sources
Publishing Houses, Scott Turow Sue Meta Over AI Training Data Copyrightthenation.com
ai5 hrs agoFraming55Framing risk55/100Rewrite inherits negative framing of Meta's actions through loaded verbs and phrases, with lede misdirection centering on lawsuit filing over core infringement allegations.Click to jump to full framing analysis

Publishing Houses, Scott Turow Sue Meta Over AI Training Data Copyright

Five major publishing houses and author Scott Turow filed a class action lawsuit against Meta and CEO Mark Zuckerberg, alleging the company illegally used millions of copyrighted books and journal articles to train its Llama AI model. The suit, filed in federal court in Manhattan…

fortune.com
The Washington Post
Financial Times
NPR
4 sources
Italian Prime Minister Meloni Warns of AI-Generated Deepfakes and Shares Altered ImagePrime Minister's Office / Wikimedia (GODL-India)
ai1 hr agoDeveloping

Italian Prime Minister Meloni Warns of AI-Generated Deepfakes and Shares Altered Image

Italian Prime Minister Giorgia Meloni highlighted risks from AI-generated fake images, noting one depicting her in underwear and urging verification of online content. She filed a libel suit two years ago over similar deepfake images. Meanwhile, U.S. Secretary of State Marco Rubi…

The Independent
1 source