Discover Oncology (Jan 2023)

Machine learning to predict overall short-term mortality in cutaneous melanoma

  • C. Cozzolino,
  • A. Buja,
  • M. Rugge,
  • A. Miatton,
  • M. Zorzi,
  • A. Vecchiato,
  • P. Del Fiore,
  • S. Tropea,
  • A. Brazzale,
  • G. Damiani,
  • L. dall’Olmo,
  • C. R. Rossi,
  • S. Mocellin

DOI
https://doi.org/10.1007/s12672-023-00622-5
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Background Cutaneous malignant melanoma (CMM) ranks among the ten most frequent malignancies, clinicopathological staging being of key importance to predict prognosis. Artificial intelligence (AI) has been recently applied to develop prognostically reliable staging systems for CMM. This study aims to provide a useful machine learning based tool to predict the overall CMM short-term survival. Methods CMM records as collected at the Veneto Cancer Registry (RTV) and at the Veneto regional health service were considered. A univariate Cox regression validated the strength and direction of each independent variable with overall mortality. A range of machine learning models (Logistic Regression classifier, Support-Vector Machine, Random Forest, Gradient Boosting, and k-Nearest Neighbors) and a Deep Neural Network were then trained to predict the 3-years mortality probability. Five-fold cross-validation and Grid Search were performed to test the best data preprocessing procedures, features selection, and to optimize models hyperparameters. A final evaluation was carried out on a separate test set in terms of balanced accuracy, precision, recall and F1 score. The best model was deployed as online tool. Results The univariate analysis confirmed the significant prognostic value of TNM staging. Adjunctive clinicopathological variables not included in the AJCC 8th melanoma staging system, i.e., sex, tumor site, histotype, growth phase, and age, were significantly linked to overall survival. Among the models, the Neural Network and the Random Forest models featured the best prognostic performance, achieving a balanced accuracy of 91% and 88%, respectively. According to the Gini importance score, age, T and M stages, mitotic count, and ulceration appeared to be the variables with the greatest impact on survival prediction. Conclusions Using data from patients with CMM, we developed an AI algorithm with high staging reliability, on top of which a web tool was implemented ( unipd.link/melanomaprediction ). Being essentially based on routinely recorded clinicopathological variables, it can already be implemented with minimal effort and further tested in the current clinical practice, an essential phase for validating the model’s accuracy beyond the original research context.

Keywords