Computational and Structural Biotechnology Journal (Jan 2020)

A deep learning and similarity-based hierarchical clustering approach for pathological stage prediction of papillary renal cell carcinoma

  • Sugi Lee,
  • Jaeeun Jung,
  • Ilkyu Park,
  • Kunhyang Park,
  • Dae-Soo Kim

Journal volume & issue
Vol. 18
pp. 2639 – 2646

Abstract

Read online

Papillary renal cell carcinoma (pRCC), which accounts for 10–15% of renal cell carcinomas, is the second most frequent renal cell carcinoma. pRCC patient classification is difficult because of disease heterogeneity, histologic subtypes, and variations in both disease progression and patient outcomes. Nevertheless, symptom-based patient classification is indispensable in deciding treatment options. Here we introduce a prediction method for distinguishing pRCC pathological tumour stages using deep learning and similarity-based hierarchical clustering approaches. Differentially expressed genes (DEGs) were identified from gene expression data of pRCC patients retrieved from TCGA. Thirty-three of these genes were distinguished based on expression in early or late stage pRCC using the Wilcoxon rank sum test, confidence interval, and LASSO regression. Then, a deep learning model was constructed to predict tumour progression with an accuracy of 0.942 and area under curve of 0.933. Furthermore, pathological sub-stage information with an accuracy of 0.857 was obtained via similarity-based hierarchical clustering using 18 DEGs between stages I and II, and 11 DEGs between stages III and IV, identified through Wilcoxon rank sum test and quantile approach. Additionally, we offer this classification process as an R function. This is the first report of a model distinguishing the pathological tumour stages of pRCC using deep learning and similarity-based hierarchical clustering methods. Our findings are potentially applicable for improving early detection and treatment of pRCC and establishing a clearer classification of the pathological stages in other tumours.

Keywords