Google Bard and ChatGPT in Orthopedics: Which Is the Better Doctor in Sports Medicine and Pediatric Orthopedics? The Role of AI in Patient Education

Riccardo Giorgino; Mario Alessandri-Bonetti; Matteo Del Re; Fabio Verdoni; Giuseppe M. Peretti; Laura Mangiavini

doi:10.3390/diagnostics14121253

Diagnostics (Jun 2024)

Google Bard and ChatGPT in Orthopedics: Which Is the Better Doctor in Sports Medicine and Pediatric Orthopedics? The Role of AI in Patient Education

Riccardo Giorgino,
Mario Alessandri-Bonetti,
Matteo Del Re,
Fabio Verdoni,
Giuseppe M. Peretti,
Laura Mangiavini

Affiliations

Riccardo Giorgino: Residency Program in Orthopaedics and Traumatology, University of Milan, 20122 Milan, Italy
Mario Alessandri-Bonetti: Department of Plastic Surgery, University of Pittsburgh Medical Center, 1350 Locust Street, Pittsburgh, PA 15213, USA
Matteo Del Re: IRCCS Ospedale Galeazzi Sant’ambrogio, 20157 Milan, Italy
Fabio Verdoni: IRCCS Ospedale Galeazzi Sant’ambrogio, 20157 Milan, Italy
Giuseppe M. Peretti: IRCCS Ospedale Galeazzi Sant’ambrogio, 20157 Milan, Italy
Laura Mangiavini: IRCCS Ospedale Galeazzi Sant’ambrogio, 20157 Milan, Italy

DOI: https://doi.org/10.3390/diagnostics14121253
Journal volume & issue: Vol. 14, no. 12
p. 1253

Abstract

Read online

Background: This study evaluates the potential of ChatGPT and Google Bard as educational tools for patients in orthopedics, focusing on sports medicine and pediatric orthopedics. The aim is to compare the quality of responses provided by these natural language processing (NLP) models, addressing concerns about the potential dissemination of incorrect medical information. Methods: Ten ACL- and flat foot-related questions from a Google search were presented to ChatGPT-3.5 and Google Bard. Expert orthopedic surgeons rated the responses using the Global Quality Score (GQS). The study minimized bias by clearing chat history before each question, maintaining respondent anonymity and employing statistical analysis to compare response quality. Results: ChatGPT-3.5 and Google Bard yielded good-quality responses, with average scores of 4.1 ± 0.7 and 4 ± 0.78, respectively, for sports medicine. For pediatric orthopedics, Google Bard scored 3.5 ± 1, while the average score for responses generated by ChatGPT was 3.8 ± 0.83. In both cases, no statistically significant difference was found between the platforms (p = 0.6787, p = 0.3092). Despite ChatGPT’s responses being considered more readable, both platforms showed promise for AI-driven patient education, with no reported misinformation. Conclusions: ChatGPT and Google Bard demonstrate significant potential as supplementary patient education resources in orthopedics. However, improvements are needed for increased reliability. The study underscores the evolving role of AI in orthopedics and calls for continued research to ensure a conscientious integration of AI in healthcare education.

Published in Diagnostics

ISSN: 2075-4418 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Medicine (General)
Website: http://www.mdpi.com/journal/diagnostics

About the journal

Abstract

Keywords