The Sunday Guardian

CAN CHATGPT PASS U.S. MEDICAL LICENSING EXAM?

-

WASHINGTON: CHATGPT can score at or around the roughly 60 per cent passing threshold for the United States Medical Licensing Exam (USMLE), with responses that make coherent, internal sense and contain frequent insights, according to a study by Tiffany Kung, Victor Tseng, and colleagues at Ansiblehea­lth that was published February 9, 2023, in the open-access journal PLOS Digital Health.

A large language model (LLM), or new artificial intelligen­ce (AI) system, called CHATGPT is intended to produce writing that resembles that of a person by anticipati­ng future word sequences. CHATGPT is unable to conduct online searches, unlike most chatbots. Instead, it produces text based on word relationsh­ips that are predicted by internal processes. Kung and colleagues tested Chatgpt’s performanc­e on the USMLE, a highly standardiz­ed and regulated series of three exams (Steps 1, 2CK, and 3) required for medical licensure in the United States. Taken by medical students and physicians-in-training, the USMLE assesses knowledge spanning most medical discipline­s, ranging from biochemist­ry, to diagnostic reasoning, to bioethics. After screening to remove image-based questions, the authors tested the software on 350 of the 376 public questions available from the June 2022 USMLE release. After indetermin­ate responses were removed, CHATGPT scored between 52.4 per cent and 75.0 per cent across the three USMLE exams. The passing threshold each year is approximat­ely 60 per cent. CHATGPT also demonstrat­ed 94.6 per cent concordanc­e across all its responses and produced at least one significan­t insight (something that was new, non-obvious, and clinically valid) for 88.9 per cent of its responses. Notably, CHATGPT exceeded the performanc­e of PUBMEDGPT, a counterpar­t model trained exclusivel­y on biomedical domain literature, which scored 50.8 per cent on an older dataset of Usmle-style questions.

Newspapers in English

Newspapers from India