Predicting oncogene mutations of lung cancer using deep learning and histopathologic features on whole-slide images

Naofumi Tomita; Laura J. Tafe; Arief A. Suriawinata; Gregory J. Tsongalis; Mustafa Nasir-Moin; Konstantin Dragnev; Saeed Hassanpour

Translational Oncology (Oct 2022)

Predicting oncogene mutations of lung cancer using deep learning and histopathologic features on whole-slide images

Naofumi Tomita,
Laura J. Tafe,
Arief A. Suriawinata,
Gregory J. Tsongalis,
Mustafa Nasir-Moin,
Konstantin Dragnev,
Saeed Hassanpour

Affiliations

Naofumi Tomita: Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA
Laura J. Tafe: Department of Pathology and Laboratory Medicine, Dartmouth-Hitchcock Medical Center, Lebanon, NH 03756, USA
Arief A. Suriawinata: Department of Pathology and Laboratory Medicine, Dartmouth-Hitchcock Medical Center, Lebanon, NH 03756, USA
Gregory J. Tsongalis: Department of Pathology and Laboratory Medicine, Dartmouth-Hitchcock Medical Center, Lebanon, NH 03756, USA
Mustafa Nasir-Moin: Department of Computer Science, Dartmouth College, Hanover, NH 03755, USA
Konstantin Dragnev: Hematology and Oncology Section at Norris Cotton Cancer Center, Lebanon, NH 03756, USA
Saeed Hassanpour: Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA; Department of Computer Science, Dartmouth College, Hanover, NH 03755, USA; Department of Epidemiology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA; Corresponding author at: Saeed Hassanpour, PhD, One Medical Center Drive, HB 7261, Lebanon, NH 03756, USA.

Journal volume & issue: Vol. 24
p. 101494

Abstract

Read online

Lung cancer is a leading cause of death in both men and women globally. The recent development of tumor molecular profiling has opened opportunities for targeted therapies for lung adenocarcinoma (LUAD) patients. However, the lack of access to molecular profiling or cost and turnaround time associated with it could hinder oncologists' willingness to order frequent molecular tests, limiting potential benefits from precision medicine. In this study, we developed a weakly supervised deep learning model for predicting somatic mutations of LUAD patients based on formalin-fixed paraffin-embedded (FFPE) whole-slide images (WSIs) using LUAD subtypes-related histological features and recent advances in computer vision. Our study was performed on a total of 747 hematoxylin and eosin (H&E) stained FFPE LUAD WSIs and the genetic mutation data of 232 patients who were treated at Dartmouth-Hitchcock Medical Center (DHMC). We developed our convolutional neural network-based models to analyze whole slides and predict five major genetic mutations, i.e., BRAF, EGFR, KRAS, STK11, and TP53. We additionally used 111 cases from the LUAD dataset of the CPTAC-3 study for external validation. Our model achieved an AUROC of 0.799 (95% CI: 0.686–0.904) and 0.686 (95% CI: 0.620–0.752) for predicting EGFR genetic mutations on the DHMC and CPTAC-3 test sets, respectively. Predicting TP53 genetic mutations also showed promising outcomes. Our results demonstrated that H&E stained FFPE LUAD whole slides could be utilized to predict oncogene mutations, such as EGFR, indicating that somatic mutations could present subtle morphological characteristics in histology slides, where deep learning-based feature extractors can learn such latent information.

Published in Translational Oncology

ISSN: 1944-7124 (Print); 1936-5233 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Medicine: Internal medicine: Neoplasms. Tumors. Oncology. Including cancer and carcinogens
Website: https://www.journals.elsevier.com/translational-oncology/

About the journal