Deep Learning-Based Prediction Model for Breast Cancer Recurrence Using Adjuvant Breast Cancer Cohort in Tertiary Cancer Center Registry

Ji-Yeon Kim; Yong Seok Lee; Jonghan Yu; Youngmin Park; Se Kyung Lee; Minyoung Lee; Jeong Eon Lee; Seok Won Kim; Seok Jin Nam; Yeon Hee Park; Jin Seok Ahn; Mira Kang; Young-Hyuck Im

doi:10.3389/fonc.2021.596364

Frontiers in Oncology (May 2021)

Deep Learning-Based Prediction Model for Breast Cancer Recurrence Using Adjuvant Breast Cancer Cohort in Tertiary Cancer Center Registry

Ji-Yeon Kim,
Yong Seok Lee,
Jonghan Yu,
Youngmin Park,
Se Kyung Lee,
Minyoung Lee,
Jeong Eon Lee,
Seok Won Kim,
Seok Jin Nam,
Yeon Hee Park,
Jin Seok Ahn,
Mira Kang,
Young-Hyuck Im

Affiliations

Ji-Yeon Kim: Division of Hematology-Oncology, Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Yong Seok Lee: Digital Health Business Team, Samsung SDS, Seoul, South Korea
Jonghan Yu: Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Youngmin Park: Digital Health Business Team, Samsung SDS, Seoul, South Korea
Se Kyung Lee: Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Minyoung Lee: Digital Health Business Team, Samsung SDS, Seoul, South Korea
Jeong Eon Lee: Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Seok Won Kim: Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Seok Jin Nam: Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Yeon Hee Park: Division of Hematology-Oncology, Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Jin Seok Ahn: Division of Hematology-Oncology, Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Mira Kang: Center for Health Promotion, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Young-Hyuck Im: Division of Hematology-Oncology, Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea

DOI: https://doi.org/10.3389/fonc.2021.596364
Journal volume & issue: Vol. 11

Abstract

Read online

Several prognosis prediction models have been developed for breast cancer (BC) patients with curative surgery, but there is still an unmet need to precisely determine BC prognosis for individual BC patients in real time. This is a retrospectively collected data analysis from adjuvant BC registry at Samsung Medical Center between January 2000 and December 2016. The initial data set contained 325 clinical data elements: baseline characteristics with demographics, clinical and pathologic information, and follow-up clinical information including laboratory and imaging data during surveillance. Weibull Time To Event Recurrent Neural Network (WTTE-RNN) by Martinsson was implemented for machine learning. We searched for the optimal window size as time-stamped inputs. To develop the prediction model, data from 13,117 patients were split into training (60%), validation (20%), and test (20%) sets. The median follow-up duration was 4.7 years and the median number of visits was 8.4. We identified 32 features related to BC recurrence and considered them in further analyses. Performance at a point of statistics was calculated using Harrell's C-index and area under the curve (AUC) at each 2-, 5-, and 7-year points. After 200 training epochs with a batch size of 100, the C-index reached 0.92 for the training data set and 0.89 for the validation and test data sets. The AUC values were 0.90 at 2-year point, 0.91 at 5-year point, and 0.91 at 7-year point. The deep learning-based final model outperformed three other machine learning-based models. In terms of pathologic characteristics, the median absolute error (MAE) and weighted mean absolute error (wMAE) showed great results of as little as 3.5%. This BC prognosis model to determine the probability of BC recurrence in real time was developed using information from the time of BC diagnosis and the follow-up period in RNN machine learning model.

Published in Frontiers in Oncology

ISSN: 2234-943X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neoplasms. Tumors. Oncology. Including cancer and carcinogens
Website: https://www.frontiersin.org/journals/oncology/

About the journal

Abstract

Keywords