Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models.

Tiffany H Kung; Morgan Cheatham; Arielle Medenilla; Czarina Sillos; Lorie De Leon; Camille Elepaño; Maria Madriaga; Rimel Aggabao; Giezel Diaz-Candido; James Maningo; Victor Tseng

doi:10.1371/journal.pdig.0000198

PLOS Digital Health (Feb 2023)

Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models.

Tiffany H Kung,
Morgan Cheatham,
Arielle Medenilla,
Czarina Sillos,
Lorie De Leon,
Camille Elepaño,
Maria Madriaga,
Rimel Aggabao,
Giezel Diaz-Candido,
James Maningo,
Victor Tseng

Affiliations

Tiffany H Kung
Morgan Cheatham
Arielle Medenilla
Czarina Sillos
Lorie De Leon
Camille Elepaño
Maria Madriaga
Rimel Aggabao
Giezel Diaz-Candido
James Maningo
Victor Tseng

DOI: https://doi.org/10.1371/journal.pdig.0000198
Journal volume & issue: Vol. 2, no. 2
p. e0000198

Abstract

Read online

We evaluated the performance of a large language model called ChatGPT on the United States Medical Licensing Exam (USMLE), which consists of three exams: Step 1, Step 2CK, and Step 3. ChatGPT performed at or near the passing threshold for all three exams without any specialized training or reinforcement. Additionally, ChatGPT demonstrated a high level of concordance and insight in its explanations. These results suggest that large language models may have the potential to assist with medical education, and potentially, clinical decision-making.

Published in PLOS Digital Health

ISSN: 2767-3170 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://journals.plos.org/digitalhealth/

About the journal