Evaluation of information from artificial intelligence on rotator cuff repair surgery

Eric Warren, Jr., BS; Eoghan T. Hurley, MB, MCh, PhD; Caroline N. Park, MD; Bryan S. Crook, MD; Samuel Lorentz, MD; Jay M. Levin, MD, MBA; Oke Anakwenze, MD, MBA; Peter B. MacDonald, MD, FRCSC; Christopher S. Klifto, MD

JSES International (Jan 2024)

Evaluation of information from artificial intelligence on rotator cuff repair surgery

Eric Warren, Jr., BS,
Eoghan T. Hurley, MB, MCh, PhD,
Caroline N. Park, MD,
Bryan S. Crook, MD,
Samuel Lorentz, MD,
Jay M. Levin, MD, MBA,
Oke Anakwenze, MD, MBA,
Peter B. MacDonald, MD, FRCSC,
Christopher S. Klifto, MD

Affiliations

Eric Warren, Jr., BS: Duke University School of Medicine, Duke University, Durham, NC, USA
Eoghan T. Hurley, MB, MCh, PhD: Department of Orthopaedic Surgery, Duke University, Durham, NC, USA
Caroline N. Park, MD: Department of Orthopaedic Surgery, Duke University, Durham, NC, USA
Bryan S. Crook, MD: Department of Orthopaedic Surgery, Duke University, Durham, NC, USA
Samuel Lorentz, MD: Department of Orthopaedic Surgery, Duke University, Durham, NC, USA
Jay M. Levin, MD, MBA: Department of Orthopaedic Surgery, Duke University, Durham, NC, USA
Oke Anakwenze, MD, MBA: Department of Orthopaedic Surgery, Duke University, Durham, NC, USA
Peter B. MacDonald, MD, FRCSC: Section of Orthopaedic Surgery & The Pan Am Clinic, University of Manitoba, Winnipeg, MB, Canada
Christopher S. Klifto, MD: Department of Orthopaedic Surgery, Duke University, Durham, NC, USA; Corresponding author: Christopher S. Klifto, MD, Duke University Department of Orthopaedic Surgery, 3609 SW Durham Drive, Durham, NC 27707, USA.

Journal volume & issue: Vol. 8, no. 1
pp. 53 – 57

Abstract

Read online

Purpose: The purpose of this study was to analyze the quality and readability of information regarding rotator cuff repair surgery available using an online AI software. Methods: An open AI model (ChatGPT) was used to answer 24 commonly asked questions from patients on rotator cuff repair. Questions were stratified into one of three categories based on the Rothwell classification system: fact, policy, or value. The answers for each category were evaluated for reliability, quality and readability using The Journal of the American Medical Association Benchmark criteria, DISCERN score, Flesch-Kincaid Reading Ease Score and Grade Level. Results: The Journal of the American Medical Association Benchmark criteria score for all three categories was 0, which is the lowest score indicating no reliable resources cited. The DISCERN score was 51 for fact, 53 for policy, and 55 for value questions, all of which are considered good scores. Across question categories, the reliability portion of the DISCERN score was low, due to a lack of resources. The Flesch-Kincaid Reading Ease Score (and Flesch-Kincaid Grade Level) was 48.3 (10.3) for the fact class, 42.0 (10.9) for the policy class, and 38.4 (11.6) for the value class. Conclusion: The quality of information provided by the open AI chat system was generally high across all question types but had significant shortcomings in reliability due to the absence of source material citations. The DISCERN scores of the AI generated responses matched or exceeded previously published results of studies evaluating the quality of online information about rotator cuff repairs. The responses were U.S. 10th grade or higher reading level which is above the AMA and NIH recommendation of 6th grade reading level for patient materials. The AI software commonly referred the user to seek advice from orthopedic surgeons to improve their chances of a successful outcome.

Published in JSES International

ISSN: 2666-6383 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Surgery: Orthopedic surgery; Medicine: Internal medicine: Specialties of internal medicine: Diseases of the musculoskeletal system
Website: https://www.journals.elsevier.com/jses-international

About the journal

Abstract

Keywords