Machine learning methods to predict presence of residual cancer following hysterectomy

Reetam Ganguli; Jordan Franklin; Xiaotian Yu; Alice Lin; Daithi S. Heffernan

doi:10.1038/s41598-022-06585-x

Scientific Reports (Feb 2022)

Machine learning methods to predict presence of residual cancer following hysterectomy

Reetam Ganguli,
Jordan Franklin,
Xiaotian Yu,
Alice Lin,
Daithi S. Heffernan

Affiliations

Reetam Ganguli: Brown University
Jordan Franklin: Department of Computer Sciences, Georgia Institute of Technology
Xiaotian Yu: Department of Mathematics, University of Virginia
Alice Lin: Warren Alpert Medical School
Daithi S. Heffernan: Brown University

DOI: https://doi.org/10.1038/s41598-022-06585-x
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Surgical management for gynecologic malignancies often involves hysterectomy, often constituting the most common gynecologic surgery worldwide. Despite maximal surgical and medical care, gynecologic malignancies have a high rate of recurrence following surgery. Current machine learning models use advanced pathology data that is often inaccessible within low-resource settings and are specific to singular cancer types. There is currently a need for machine learning models to predict non-clinically evident residual disease using only clinically available health data. Here we developed and tested multiple machine learning models to assess the risk of residual disease post-hysterectomy based on clinical and operative parameters. Data from 3656 hysterectomy patients from the NSQIP dataset over 14 years were used to develop models with a training set of 2925 patients and a validation set of 731 patients. Our models revealed the top postoperative predictors of residual disease were the initial presence of gross abdominal disease on the diaphragm, disease located on the bowel mesentery, located on the bowel serosa, and disease located within the adjacent pelvis prior to resection. There were no statistically significant differences in performances of the top three models. Extreme gradient Boosting, Random Forest, and Logistic Regression models had comparable AUC ROC (0.90) and accuracy metrics (87–88%). Using these models, physicians can identify gynecologic cancer patients post-hysterectomy that may benefit from additional treatment. For patients at high risk for disease recurrence despite adequate surgical intervention, machine learning models may lay the basis for potential prospective trials with prophylactic/adjuvant therapy for non-clinically evident residual disease, particularly in under-resourced settings.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal