Interpretable survival prediction for colorectal cancer using deep learning

Ellery Wulczyn; David F. Steiner; Melissa Moran; Markus Plass; Robert Reihs; Fraser Tan; Isabelle Flament-Auvigne; Trissia Brown; Peter Regitnig; Po-Hsuan Cameron Chen; Narayan Hegde; Apaar Sadhwani; Robert MacDonald; Benny Ayalew; Greg S. Corrado; Lily H. Peng; Daniel Tse; Heimo Müller; Zhaoyang Xu; Yun Liu; Martin C. Stumpe; Kurt Zatloukal; Craig H. Mermel

doi:10.1038/s41746-021-00427-2

npj Digital Medicine (Apr 2021)

Interpretable survival prediction for colorectal cancer using deep learning

Ellery Wulczyn,
David F. Steiner,
Melissa Moran,
Markus Plass,
Robert Reihs,
Fraser Tan,
Isabelle Flament-Auvigne,
Trissia Brown,
Peter Regitnig,
Po-Hsuan Cameron Chen,
Narayan Hegde,
Apaar Sadhwani,
Robert MacDonald,
Benny Ayalew,
Greg S. Corrado,
Lily H. Peng,
Daniel Tse,
Heimo Müller,
Zhaoyang Xu,
Yun Liu,
Martin C. Stumpe,
Kurt Zatloukal,
Craig H. Mermel

Affiliations

Ellery Wulczyn: Google Health
David F. Steiner: Google Health
Melissa Moran: Google Health
Markus Plass: Medical University of Graz
Robert Reihs: Medical University of Graz
Fraser Tan: Google Health
Isabelle Flament-Auvigne: Google Health via Advanced Clinical
Trissia Brown: Google Health via Advanced Clinical
Peter Regitnig: Medical University of Graz
Po-Hsuan Cameron Chen: Google Health
Narayan Hegde: Google Health
Apaar Sadhwani: Google Health
Robert MacDonald: Google Health
Benny Ayalew: Google Health
Greg S. Corrado: Google Health
Lily H. Peng: Google Health
Daniel Tse: Google Health
Heimo Müller: Medical University of Graz
Zhaoyang Xu: Google Health
Yun Liu: Google Health
Martin C. Stumpe: Google Health via Advanced Clinical
Kurt Zatloukal: Medical University of Graz
Craig H. Mermel: Google Health

DOI: https://doi.org/10.1038/s41746-021-00427-2
Journal volume & issue: Vol. 4, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Deriving interpretable prognostic features from deep-learning-based prognostic histopathology models remains a challenge. In this study, we developed a deep learning system (DLS) for predicting disease-specific survival for stage II and III colorectal cancer using 3652 cases (27,300 slides). When evaluated on two validation datasets containing 1239 cases (9340 slides) and 738 cases (7140 slides), respectively, the DLS achieved a 5-year disease-specific survival AUC of 0.70 (95% CI: 0.66–0.73) and 0.69 (95% CI: 0.64–0.72), and added significant predictive value to a set of nine clinicopathologic features. To interpret the DLS, we explored the ability of different human-interpretable features to explain the variance in DLS scores. We observed that clinicopathologic features such as T-category, N-category, and grade explained a small fraction of the variance in DLS scores (R 2 = 18% in both validation sets). Next, we generated human-interpretable histologic features by clustering embeddings from a deep-learning-based image-similarity model and showed that they explained the majority of the variance (R 2 of 73–80%). Furthermore, the clustering-derived feature most strongly associated with high DLS scores was also highly prognostic in isolation. With a distinct visual appearance (poorly differentiated tumor cell clusters adjacent to adipose tissue), this feature was identified by annotators with 87.0–95.5% accuracy. Our approach can be used to explain predictions from a prognostic deep learning model and uncover potentially-novel prognostic features that can be reliably identified by people for future validation studies.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal