Mathematical Biosciences and Engineering (May 2021)

Robust rank aggregation and cibersort algorithm applied to the identification of key genes in head and neck squamous cell cancer

  • Tingting Chen,
  • Wei Hua ,
  • Bing Xu,
  • Hui Chen,
  • Minhao Xie,
  • Xinchen Sun,
  • Xiaolin Ge

DOI
https://doi.org/10.3934/mbe.2021228
Journal volume & issue
Vol. 18, no. 4
pp. 4491 – 4507

Abstract

Read online

Objective: Although multiple hub genes have been identified in head and neck squamous cell cancer (HNSCC) in recent years, because of the limited sample size and inconsistent bioinformatics analysis methods, the results are not reliable. Therefore, it is urgent to use reliable algorithms to find new prognostic markers of HNSCC. Method: The Robust Rank Aggregation (RRA) method was used to integrate 8 microarray datasets of HNSCC downloaded from the Gene Expression Omnibus (GEO) database to screen differentially expressed genes (DEGs). Later, Gene Ontology (GO) functional annotation together with Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis was carried out to discover functions of those discovered DEGs. According to the KEGG results, those discovered DEGs showed tight association with the occurrence and development of HNSCC. Then cibersort algorithm was used to analyze the infiltration of immune cells of HNSCC and we found that the main infiltrated immune cells were B cells, dendritic cells and macrophages. A protein-protein interaction (PPI) network was established; moreover, key modules were also constructed to select 5 hub genes from the whole network using cytoHubba. 3 hub genes showed significant relationship with prognosis for TCGA-derived HNSCC patients. Result: The potent DEGs along with hub genes were selected by the combined bioinformatic approach. AURKA, BIRC5 and UBE2C genes may be the potential prognostic biomarker and therapeutic targets of HNSCC. Conclusions: The Robust Rank Aggregation method and cibersort algorithm method can accurately predict the potential prognostic biomarker and therapeutic targets of HNSCC through multiple GEO datasets.

Keywords