JSES International (Jan 2024)

Evaluation of information from artificial intelligence on rotator cuff repair surgery

  • Eric Warren, Jr., BS,
  • Eoghan T. Hurley, MB, MCh, PhD,
  • Caroline N. Park, MD,
  • Bryan S. Crook, MD,
  • Samuel Lorentz, MD,
  • Jay M. Levin, MD, MBA,
  • Oke Anakwenze, MD, MBA,
  • Peter B. MacDonald, MD, FRCSC,
  • Christopher S. Klifto, MD

Journal volume & issue
Vol. 8, no. 1
pp. 53 – 57

Abstract

Read online

Purpose: The purpose of this study was to analyze the quality and readability of information regarding rotator cuff repair surgery available using an online AI software. Methods: An open AI model (ChatGPT) was used to answer 24 commonly asked questions from patients on rotator cuff repair. Questions were stratified into one of three categories based on the Rothwell classification system: fact, policy, or value. The answers for each category were evaluated for reliability, quality and readability using The Journal of the American Medical Association Benchmark criteria, DISCERN score, Flesch-Kincaid Reading Ease Score and Grade Level. Results: The Journal of the American Medical Association Benchmark criteria score for all three categories was 0, which is the lowest score indicating no reliable resources cited. The DISCERN score was 51 for fact, 53 for policy, and 55 for value questions, all of which are considered good scores. Across question categories, the reliability portion of the DISCERN score was low, due to a lack of resources. The Flesch-Kincaid Reading Ease Score (and Flesch-Kincaid Grade Level) was 48.3 (10.3) for the fact class, 42.0 (10.9) for the policy class, and 38.4 (11.6) for the value class. Conclusion: The quality of information provided by the open AI chat system was generally high across all question types but had significant shortcomings in reliability due to the absence of source material citations. The DISCERN scores of the AI generated responses matched or exceeded previously published results of studies evaluating the quality of online information about rotator cuff repairs. The responses were U.S. 10th grade or higher reading level which is above the AMA and NIH recommendation of 6th grade reading level for patient materials. The AI software commonly referred the user to seek advice from orthopedic surgeons to improve their chances of a successful outcome.

Keywords