Predicting Breast Cancer Gene Expression Signature by Applying Deep Convolutional Neural Networks From Unannotated Pathological Images

Nam Nhut Phan; Nam Nhut Phan; Nam Nhut Phan; Chi-Cheng Huang; Chi-Cheng Huang; Ling-Ming Tseng; Ling-Ming Tseng; Eric Y. Chuang; Eric Y. Chuang; Eric Y. Chuang

doi:10.3389/fonc.2021.769447

Frontiers in Oncology (Dec 2021)

Predicting Breast Cancer Gene Expression Signature by Applying Deep Convolutional Neural Networks From Unannotated Pathological Images

Nam Nhut Phan,
Nam Nhut Phan,
Nam Nhut Phan,
Chi-Cheng Huang,
Chi-Cheng Huang,
Ling-Ming Tseng,
Ling-Ming Tseng,
Eric Y. Chuang,
Eric Y. Chuang,
Eric Y. Chuang

Affiliations

Nam Nhut Phan: Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan
Nam Nhut Phan: Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan
Nam Nhut Phan: Bioinformatics and Biostatistics Core, Centre of Genomic and Precision Medicine, National Taiwan University, Taipei, Taiwan
Chi-Cheng Huang: Comprehensive Breast Health Center, Taipei Veterans General Hospital, Taipei, Taiwan
Chi-Cheng Huang: Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
Ling-Ming Tseng: Comprehensive Breast Health Center, Taipei Veterans General Hospital, Taipei, Taiwan
Ling-Ming Tseng: School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
Eric Y. Chuang: Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan
Eric Y. Chuang: Bioinformatics and Biostatistics Core, Centre of Genomic and Precision Medicine, National Taiwan University, Taipei, Taiwan
Eric Y. Chuang: Master Program for Biomedical Engineering, China Medical University, Taichung, Taiwan

DOI: https://doi.org/10.3389/fonc.2021.769447
Journal volume & issue: Vol. 11

Abstract

Read online

We proposed a highly versatile two-step transfer learning pipeline for predicting the gene signature defining the intrinsic breast cancer subtypes using unannotated pathological images. Deciphering breast cancer molecular subtypes by deep learning approaches could provide a convenient and efficient method for the diagnosis of breast cancer patients. It could reduce costs associated with transcriptional profiling and subtyping discrepancy between IHC assays and mRNA expression. Four pretrained models such as VGG16, ResNet50, ResNet101, and Xception were trained with our in-house pathological images from breast cancer patient with recurrent status in the first transfer learning step and TCGA-BRCA dataset for the second transfer learning step. Furthermore, we also trained ResNet101 model with weight from ImageNet for comparison to the aforementioned models. The two-step deep learning models showed promising classification results of the four breast cancer intrinsic subtypes with accuracy ranging from 0.68 (ResNet50) to 0.78 (ResNet101) in both validation and testing sets. Additionally, the overall accuracy of slide-wise prediction showed even higher average accuracy of 0.913 with ResNet101 model. The micro- and macro-average area under the curve (AUC) for these models ranged from 0.88 (ResNet50) to 0.94 (ResNet101), whereas ResNet101_imgnet weighted with ImageNet archived an AUC of 0.92. We also show the deep learning model prediction performance is significantly improved relatively to the common Genefu tool for breast cancer classification. Our study demonstrated the capability of deep learning models to classify breast cancer intrinsic subtypes without the region of interest annotation, which will facilitate the clinical applicability of the proposed models.

Published in Frontiers in Oncology

ISSN: 2234-943X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neoplasms. Tumors. Oncology. Including cancer and carcinogens
Website: https://www.frontiersin.org/journals/oncology/

About the journal

Abstract

Keywords