Evaluation of the accuracy and readability of ChatGPT-4 and Google Gemini in providing information on retinal detachment: a multicenter expert comparative study

Piotr Strzalkowski; Alicja Strzalkowska; Jay Chhablani; Kristina Pfau; Marie-Hélène Errera; Mathias Roth; Friederike Schaub; Nikolaos E. Bechrakis; Hans Hoerauf; Constantin Reiter; Alexander K. Schuster; Gerd Geerling; Rainer Guthoff

doi:10.1186/s40942-024-00579-9

International Journal of Retina and Vitreous (Sep 2024)

Evaluation of the accuracy and readability of ChatGPT-4 and Google Gemini in providing information on retinal detachment: a multicenter expert comparative study

Piotr Strzalkowski,
Alicja Strzalkowska,
Jay Chhablani,
Kristina Pfau,
Marie-Hélène Errera,
Mathias Roth,
Friederike Schaub,
Nikolaos E. Bechrakis,
Hans Hoerauf,
Constantin Reiter,
Alexander K. Schuster,
Gerd Geerling,
Rainer Guthoff

Affiliations

Piotr Strzalkowski: Department of Ophthalmology, Medical Faculty and University Hospital Düsseldorf – Heinrich Heine University Düsseldorf
Alicja Strzalkowska: Department of Ophthalmology, Medical Faculty and University Hospital Düsseldorf – Heinrich Heine University Düsseldorf
Jay Chhablani: UPMC Eye Center, University of Pittsburgh
Kristina Pfau: Department of Ophthalmology, University Hospital of Basel
Marie-Hélène Errera: UPMC Eye Center, University of Pittsburgh
Mathias Roth: Department of Ophthalmology, Medical Faculty and University Hospital Düsseldorf – Heinrich Heine University Düsseldorf
Friederike Schaub: Department of Ophthalmology, University Medical Centre Rostock
Nikolaos E. Bechrakis: Department of Ophthalmology, University Hospital Essen
Hans Hoerauf: Department of Ophthalmology, University Medical Center Göttingen
Constantin Reiter: Department of Ophthalmology, Helios HSK Wiesbaden
Alexander K. Schuster: Department of Ophthalmology, Mainz University Medical Centre of the Johannes Gutenberg, University of Mainz
Gerd Geerling: Department of Ophthalmology, Medical Faculty and University Hospital Düsseldorf – Heinrich Heine University Düsseldorf
Rainer Guthoff: Department of Ophthalmology, Medical Faculty and University Hospital Düsseldorf – Heinrich Heine University Düsseldorf

DOI: https://doi.org/10.1186/s40942-024-00579-9
Journal volume & issue: Vol. 10, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background Large language models (LLMs) such as ChatGPT-4 and Google Gemini show potential for patient health education, but concerns about their accuracy require careful evaluation. This study evaluates the readability and accuracy of ChatGPT-4 and Google Gemini in answering questions about retinal detachment. Methods Comparative study analyzing responses from ChatGPT-4 and Google Gemini to 13 retinal detachment questions, categorized by difficulty levels (D1, D2, D3). Masked responses were reviewed by ten vitreoretinal specialists and rated on correctness, errors, thematic accuracy, coherence, and overall quality grading. Analysis included Flesch Readability Ease Score, word and sentence counts. Results Both Artificial Intelligence tools required college-level understanding for all difficulty levels. Google Gemini was easier to understand (p = 0.03), while ChatGPT-4 provided more correct answers for the more difficult questions (p = 0.0005) with fewer serious errors. ChatGPT-4 scored highest on most challenging questions, showing superior thematic accuracy (p = 0.003). ChatGPT-4 outperformed Google Gemini in 8 of 13 questions, with higher overall quality grades in the easiest (p = 0.03) and hardest levels (p = 0.0002), showing a lower grade as question difficulty increased. Conclusions ChatGPT-4 and Google Gemini effectively address queries about retinal detachment, offering mostly accurate answers with few critical errors, though patients require higher education for comprehension. The implementation of AI tools may contribute to improving medical care by providing accurate and relevant healthcare information quickly.

Published in International Journal of Retina and Vitreous

ISSN: 2056-9920 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Ophthalmology
Website: https://journalretinavitreous.biomedcentral.com

About the journal

Abstract

Keywords