Comparative analysis of machine learning approaches to classify tumor mutation burden in lung adenocarcinoma using histopathology images

Apaar Sadhwani; Huang-Wei Chang; Ali Behrooz; Trissia Brown; Isabelle Auvigne-Flament; Hardik Patel; Robert Findlater; Vanessa Velez; Fraser Tan; Kamilla Tekiela; Ellery Wulczyn; Eunhee S. Yi; Craig H. Mermel; Debra Hanks; Po-Hsuan Cameron Chen; Kimary Kulig; Cory Batenchuk; David F. Steiner; Peter Cimermancic

doi:10.1038/s41598-021-95747-4

Scientific Reports (Aug 2021)

Comparative analysis of machine learning approaches to classify tumor mutation burden in lung adenocarcinoma using histopathology images

Apaar Sadhwani,
Huang-Wei Chang,
Ali Behrooz,
Trissia Brown,
Isabelle Auvigne-Flament,
Hardik Patel,
Robert Findlater,
Vanessa Velez,
Fraser Tan,
Kamilla Tekiela,
Ellery Wulczyn,
Eunhee S. Yi,
Craig H. Mermel,
Debra Hanks,
Po-Hsuan Cameron Chen,
Kimary Kulig,
Cory Batenchuk,
David F. Steiner,
Peter Cimermancic

Affiliations

Apaar Sadhwani: Google Health
Huang-Wei Chang: Verily Life Sciences
Ali Behrooz: Verily Life Sciences
Trissia Brown: Google Health via Vituity
Isabelle Auvigne-Flament: Google Health via Vituity
Hardik Patel: Verily Life Sciences
Robert Findlater: Verily Life Sciences
Vanessa Velez: Verily Life Sciences
Fraser Tan: Google Health
Kamilla Tekiela: Verily Life Sciences
Ellery Wulczyn: Google Health
Eunhee S. Yi: Department of Laboratory Medicine and Pathology, Mayo Clinic
Craig H. Mermel: Google Health
Debra Hanks: Verily Life Sciences
Po-Hsuan Cameron Chen: Google Health
Kimary Kulig: Verily Life Sciences
Cory Batenchuk: Verily Life Sciences
David F. Steiner: Google Health
Peter Cimermancic: Verily Life Sciences

DOI: https://doi.org/10.1038/s41598-021-95747-4
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Both histologic subtypes and tumor mutation burden (TMB) represent important biomarkers in lung cancer, with implications for patient prognosis and treatment decisions. Typically, TMB is evaluated by comprehensive genomic profiling but this requires use of finite tissue specimens and costly, time-consuming laboratory processes. Histologic subtype classification represents an established component of lung adenocarcinoma histopathology, but can be challenging and is associated with substantial inter-pathologist variability. Here we developed a deep learning system to both classify histologic patterns in lung adenocarcinoma and predict TMB status using de-identified Hematoxylin and Eosin (H&E) stained whole slide images. We first trained a convolutional neural network to map histologic features across whole slide images of lung cancer resection specimens. On evaluation using an external data source, this model achieved patch-level area under the receiver operating characteristic curve (AUC) of 0.78–0.98 across nine histologic features. We then integrated the output of this model with clinico-demographic data to develop an interpretable model for TMB classification. The resulting end-to-end system was evaluated on 172 held out cases from TCGA, achieving an AUC of 0.71 (95% CI 0.63–0.80). The benefit of using histologic features in predicting TMB is highlighted by the significant improvement this approach offers over using the clinical features alone (AUC of 0.63 [95% CI 0.53–0.72], p = 0.002). Furthermore, we found that our histologic subtype-based approach achieved performance similar to that of a weakly supervised approach (AUC of 0.72 [95% CI 0.64–0.80]). Together these results underscore that incorporating histologic patterns in biomarker prediction for lung cancer provides informative signals, and that interpretable approaches utilizing these patterns perform comparably with less interpretable, weakly supervised approaches.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal