Diagnostics (Jun 2024)
Google Bard and ChatGPT in Orthopedics: Which Is the Better Doctor in Sports Medicine and Pediatric Orthopedics? The Role of AI in Patient Education
Abstract
Background: This study evaluates the potential of ChatGPT and Google Bard as educational tools for patients in orthopedics, focusing on sports medicine and pediatric orthopedics. The aim is to compare the quality of responses provided by these natural language processing (NLP) models, addressing concerns about the potential dissemination of incorrect medical information. Methods: Ten ACL- and flat foot-related questions from a Google search were presented to ChatGPT-3.5 and Google Bard. Expert orthopedic surgeons rated the responses using the Global Quality Score (GQS). The study minimized bias by clearing chat history before each question, maintaining respondent anonymity and employing statistical analysis to compare response quality. Results: ChatGPT-3.5 and Google Bard yielded good-quality responses, with average scores of 4.1 ± 0.7 and 4 ± 0.78, respectively, for sports medicine. For pediatric orthopedics, Google Bard scored 3.5 ± 1, while the average score for responses generated by ChatGPT was 3.8 ± 0.83. In both cases, no statistically significant difference was found between the platforms (p = 0.6787, p = 0.3092). Despite ChatGPT’s responses being considered more readable, both platforms showed promise for AI-driven patient education, with no reported misinformation. Conclusions: ChatGPT and Google Bard demonstrate significant potential as supplementary patient education resources in orthopedics. However, improvements are needed for increased reliability. The study underscores the evolving role of AI in orthopedics and calls for continued research to ensure a conscientious integration of AI in healthcare education.
Keywords