
Could AI perform a patient's physical exam?
A Mass General Brigham study suggests that LLMs could be used to aid clinicians during physical exams.
During any patient visit, a physical examination is critical in evaluating a patient’s health, determining what
The study, published in the
“Medical professionals early in their career may face challenges in performing the appropriate patient-tailored physical exam because of their limited experience or other context-dependent factors, such as lower resourced settings,” Marc D. Succi, MD, senior author of the study, said in a
Researchers prompted GPT-4 to recommend physical exam instructions based on the 19 chief complaints detailed in the
The attending physicians determined that GPT-4 provided solid instructions, scoring at least 80% of the possible points. GPT-4 provided the highest-quality instructions for examining a patient with “leg pain upon exertion,” and the lowest-quality instructions for “lower abdominal pain.”
Despite this, researchers found that the program occasionally omitted key instructions or was overly vague in its explanations, indicating the need for human oversight and evaluation. Researchers conclude that GPT-4 would serve as a solid tool/aid to fill knowledge gaps and assist physicians in their physical examinations of patients.
In their qualitative analysis of the program’s responses, reviewers were impressed with the details of special tests, but critical of the program’s lack of specificity, inclusion of redundant information, omission of informative exams, inclusion of vague language and irrelevant information, general inconsistencies and lack of calling for all vital signs.
“GPT-4 performed well in many respects, yet its occasional vagueness or omissions in critical areas, like diagnostic specificity, remind us of the necessity of physician judgement to ensure comprehensive patient care,” said Arya Rao, a student researcher and lead author of the study.
Going forward, the authors of the study call for future investigations that directly compare the diagnostic capabilities of unassisted physicians to those with access to LLMs. They also recommend that real-world cases be used to train LLMs in order to address the gaps in the diagnostic capacity of GPT-4, as demonstrated in the study.
“We anticipate an
Newsletter
Stay informed and empowered with Medical Economics enewsletter, delivering expert insights, financial strategies, practice management tips and technology trends — tailored for today’s physicians.



















