Harvard Medical School’s latest study: AI’s diagnostic decision-making in the emergency room is better than that of human doctors

ChainNewsAbmedia

2026-05-04 01:05:30

Harvard Medical School recently published a latest study on the performance of large language models in medical diagnosis in the journal Science. Through rigorous double-blind testing and clinical reasoning evaluation, the study objectively compared differences between AI systems and human physicians in interpreting medical records. The data show that the latest AI models have the edge in handling complex clinical information—especially in high-pressure, information-heavy emergency department settings. However, the researchers still emphasize that their findings do not mean that AI systems are ready to practice medicine autonomously, nor do they imply that doctors can be removed from the diagnostic process.

AI outperforms at early decision points in the ER

The research team had the LLM model evaluate patients in a standard emergency setting across different stages—from early triage to later admission decisions. At each stage, the model was only given the information available at that time—directly drawn from actual electronic medical records—and was asked to produce possible diagnostic outcomes and propose next-step treatment recommendations. In real-world emergency cases at early decision points, the model’s diagnostic accuracy was on par with, or even better than, that of the attending physicians—an outcome that even surprised the researchers.

The study stresses: AI still can’t practice medicine on its own; doctors’ role remains important

However, the researchers emphasized that their findings do not mean that AI systems are ready for autonomous medical practice, nor do they suggest that doctors can be removed from the diagnostic process.

The report also noted that the rapid development of AI remains of major significance for the science and practice of clinical medicine. Although applying artificial intelligence to clinical decision support is sometimes viewed as a high-risk measure, broader use of these tools may help reduce the human and economic costs caused by diagnostic errors, delays, and difficulties in accessing care.

This article Harvard Medical School’s latest study: AI diagnosis decisions in the ER are better than human doctors first appeared on Chain News ABMedia.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.