Cancer Informatics (Oct 2024)
Machine Learning for Dynamic Prognostication of Patients With Hepatocellular Carcinoma Using Time-Series Data: Survival Path Versus Dynamic-DeepHit HCC Model
Abstract
Objectives: Patients with intermediate or advanced hepatocellular carcinoma (HCC) require repeated disease monitoring, prognosis assessment and treatment planning. In 2018, a novel machine learning methodology “survival path” (SP) was developed to facilitate dynamic prognosis prediction and treatment planning. One year after, a deep learning approach called Dynamic Deephit was developed. The performance of the two state-of-art models in dynamic prognostication have not been compared. Methods: We trained and tested the SP and Dynamic DeepHit models in a large cohort of 2511 HCC patients using time-series data. The time-series data were converted into data of time slices, with an interval of three months. The time-dependent c-index for OS at given prediction time ( t = 1, 6, 12, 18 months) and evaluation time (∆ t = 3, 6, 9, 12, 18, 24, 36, 48 months) were compared. Results: The comparison between SP model and Dynamic DeepHit-HCC model showed the latter had significant better performance at the time of initial admission. The time-dependent c-index of Dynamic DeepHit-HCC model gradually decreased with the extension of time (from 0.756 to 0.639 in the training set; from 0.787 to 0.661 in internal testing set; from 0.725 to 0.668 in multicenter testing set); while the time-dependent c-index of SP model displayed an increased trend (from 0.665 to 0.748 in the training set; from 0.608 to 0.743 in internal testing set; from 0.643 to 0.720 in multicenter testing set). When the prediction time comes to 6 months or later since initial treatment, the survival path model outperformed the dynamic DeepHit model at late evaluation times (∆ t > 12 months). Conclusions: This research highlighted the unique strengths of both models. The SP model had advantage in long term prediction while the Dynamic DeepHit-HCC model had advantages in prediction at near time points. Fine selection of models is needed in dealing with different scenarios.