The Consistency and Quality of ChatGPT Responses Compared to Clinical Guidelines for Ovarian Cancer: A Delphi Approach

Dario Piazza; Federica Martorana; Annabella Curaba; Daniela Sambataro; Maria Rosaria Valerio; Alberto Firenze; Basilio Pecorino; Paolo Scollo; Vito Chiantera; Giuseppe Scibilia; Paolo Vigneri; Vittorio Gebbia; Giuseppa Scandurra

doi:10.3390/curroncol31050212

Current Oncology (May 2024)

The Consistency and Quality of ChatGPT Responses Compared to Clinical Guidelines for Ovarian Cancer: A Delphi Approach

Dario Piazza,
Federica Martorana,
Annabella Curaba,
Daniela Sambataro,
Maria Rosaria Valerio,
Alberto Firenze,
Basilio Pecorino,
Paolo Scollo,
Vito Chiantera,
Giuseppe Scibilia,
Paolo Vigneri,
Vittorio Gebbia,
Giuseppa Scandurra

Affiliations

Dario Piazza: Medical Oncology Unit, Casa di Cura Torina, 90145 Palermo, Italy
Federica Martorana: Department of Clinical and Experimental Medicine, University of Catania, 95124 Catania, Italy
Annabella Curaba: Medical Oncology Unit, Casa di Cura Torina, 90145 Palermo, Italy
Daniela Sambataro: Medical Oncology Unit, Ospedale Umberto I, 94100 Enna, Italy
Maria Rosaria Valerio: Medical Oncology Unit, Policlinico P. Giaccone, University of Palermo, 90133 Palermo, Italy
Alberto Firenze: Occupational Health Section, Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties, University of Palermo, 90133 Palermo, Italy
Basilio Pecorino: Gynecology Unit, Ospedale Cannizzaro, 95126 Catania, Italy
Paolo Scollo: Gynecology Unit, Ospedale Cannizzaro, 95126 Catania, Italy
Vito Chiantera: Gynecology, University of Palermo, 90133 Palermo, Italy
Giuseppe Scibilia: Gynecology Unit, Ospedale Paternò Arezzo, 97100 Ragusa, Italy
Paolo Vigneri: Medical Oncology, University of Catania, 95124 Catania, Italy
Vittorio Gebbia: Medical Oncology Unit, Casa di Cura Torina, 90145 Palermo, Italy
Giuseppa Scandurra: Medical Oncology Unit, Ospedale Cannizzaro, 95126 Catania, Italy

DOI: https://doi.org/10.3390/curroncol31050212
Journal volume & issue: Vol. 31, no. 5
pp. 2796 – 2804

Abstract

Read online

Introduction: In recent years, generative Artificial Intelligence models, such as ChatGPT, have increasingly been utilized in healthcare. Despite acknowledging the high potential of AI models in terms of quick access to sources and formulating responses to a clinical question, the results obtained using these models still require validation through comparison with established clinical guidelines. This study compares the responses of the AI model to eight clinical questions with the Italian Association of Medical Oncology (AIOM) guidelines for ovarian cancer. Materials and Methods: The authors used the Delphi method to evaluate responses from ChatGPT and the AIOM guidelines. An expert panel of healthcare professionals assessed responses based on clarity, consistency, comprehensiveness, usability, and quality using a five-point Likert scale. The GRADE methodology assessed the evidence quality and the recommendations’ strength. Results: A survey involving 14 physicians revealed that the AIOM guidelines consistently scored higher averages compared to the AI models, with a statistically significant difference. Post hoc tests showed that AIOM guidelines significantly differed from all AI models, with no significant difference among the AI models. Conclusions: While AI models can provide rapid responses, they must match established clinical guidelines regarding clarity, consistency, comprehensiveness, usability, and quality. These findings underscore the importance of relying on expert-developed guidelines in clinical decision-making and highlight potential areas for AI model improvement.

Published in Current Oncology

ISSN: 1198-0052 (Print); 1718-7729 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neoplasms. Tumors. Oncology. Including cancer and carcinogens
Website: https://www.mdpi.com/journal/curroncol

About the journal

Abstract

Keywords