Oman Journal of Ophthalmology (Oct 2024)

Examining the competence of artificial intelligence programs in neuro-ophthalmological disorders and analyzing their comparative superiority

  • Eyupcan Sensoy,
  • Mehmet Citirik

DOI
https://doi.org/10.4103/ojo.ojo_19_24
Journal volume & issue
Vol. 17, no. 3
pp. 348 – 351

Abstract

Read online

BACKGROUND This study aims to evaluate the knowledge levels of chat generative pretrained transformer (ChatGPT), Bing, and Bard programs, which are three different artificial intelligence chatbots offered to the market free of charge by various manufacturers, regarding neuro-ophthalmological diseases, to examine their usability, and to investigate the existence of their superiority to each other. MATERIALS AND METHODS Forty questions related to neuro-ophthalmological diseases were obtained from the study questions’ section of the American Academy and Ophthalmology 2022–2023 Basic and Clinical Science Course Neuro-ophthalmology Book. The questions were posed to the ChatGPT, Bing, and Bard artificial intelligence chatbots. The answers were evaluated as correct or incorrect. The statistical relationship between the correct and incorrect answer rates offered by the artificial intelligence programs was tested. RESULTS The correct answer rates were given by the artificial intelligence programs to the questions asked: ChatGPT – 52.5%; Bing – 55%; and Bard – 65%. There was no statistically significant difference between the correct answer rates of the three artificial intelligence programs (P = 0.489, Pearson’s Chi-square test). CONCLUSION Although information about neuro-ophthalmological diseases can be accessed quickly and accurately using up-to-date artificial intelligence programs, the answers given may not always be correct. Care should always be taken when evaluating the answers to the questions.

Keywords