Accuracy of generative artificial intelligence models in differential diagnoses of familial Mediterranean fever and deficiency of Interleukin-1 receptor antagonist

Joshua Pillai; Kathryn Pillai

Journal of Translational Autoimmunity (Dec 2023)

Accuracy of generative artificial intelligence models in differential diagnoses of familial Mediterranean fever and deficiency of Interleukin-1 receptor antagonist

Joshua Pillai,
Kathryn Pillai

Affiliations

Joshua Pillai: Department of Biological Sciences, Irvine Unified School District. 5050 Barranca Parkway, Irvine, 92604, CA, USA; Corresponding author.
Kathryn Pillai: Department of Medical Education, California University of Science and Medicine, School of Medicine. 1501 Violet St, Colton, CA, USA

Journal volume & issue: Vol. 7
p. 100213

Abstract

Read online

With the increasing development of artificial intelligence, large language models (LLMs) have been utilized to solve problems in natural language processing tasks. More recently, LLMs have shown unique potential in numerous applications within medicine but have been particularly investigated for their ability in clinical reasoning. Although the diagnostic accuracy of LLMs in forming differential diagnoses has been reviewed in general internal medicine applications, much is unknown in autoinflammatory disorders. From the nature of autoinflammatory diseases, forming a differential diagnosis is challenging due to the overlapping symptoms between disorders and even more difficult without genetic screening. In this work, the diagnostic accuracy of the Generative Pre-Trained Transformer Model-4 (GPT-4), GPT-3.5, and Large Language Model Meta AI (LLaMa) were evaluated in clinical vignettes of Deficiency of Interleukin-1 Receptor Antagonist (DIRA) and Familial Mediterranean Fever (FMF). We then compared these models to a control group including one internal medicine physician. It was found that GPT-4 did not significantly differ in correctly identifying DIRA and FMF patients compared to the internist. However, the physician maintained a significantly higher accuracy than GPT-3.5 and LLaMa 2 for either disease. Overall, we explore and discuss the unique potential of LLMs in diagnostics for autoimmune diseases.

Published in Journal of Translational Autoimmunity

ISSN: 2589-9090 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Medicine: Internal medicine: Specialties of internal medicine: Immunologic diseases. Allergy
Website: https://www.journals.elsevier.com/journal-of-translational-autoimmunity

About the journal

Abstract

Keywords