IEEE Access (Jan 2022)

A Comparative Study on Classification Methods for Renal Cell and Lung Cancers Using RNA-Seq Data

  • Bulent Haznedar,
  • Nihat Y. Simsek

DOI
https://doi.org/10.1109/ACCESS.2022.3211505
Journal volume & issue
Vol. 10
pp. 105412 – 105420

Abstract

Read online

Nowadays, The gene expression analysis gains a significant research interest and plays an important role for the classification and diagnosis of cancer types. In such research studies, the main difficulty is the processing time consumed due to numerous numbers of genes to be classified in human cell. RNA-Seq is a novel technology which enables researchers to obtain reliable knowledge in the analysis of numerous number of genes, so that can be effectively used for cancer classification. In this paper, commonly-used deep learning model based on deep neural network architecture has been proposed and utilized to analyze lung and renal cell cancer RNA-Seq datasets taken from The Cancer Genome Atlas (TCGA). The proposed method is compared with commonly-used other classical machine learning algorithms including decision trees (DT), random forests (RF), support vector machines (SVM) and artificial neural network (ANN) in terms of performance and accuracy for the same datasets. This study also presents the effects of different optimizers to the performance of deep learning algorithms. As a result, the proposed deep learning model have yielded the highest accuracy of 96.15% on renal cell and 95.54% on lung cancer data. It is found that the proposed deep learning model is very successful in classification of RNA-Seq datasets with large number of features compared. When results are compared with a previous study in literature which also analyses the same datasets, the proposed deep learning model outperforms the all other methods in various metrics.

Keywords