Mathematical Biosciences and Engineering (Jan 2020)
Lung adenocarcinoma pathology stages related gene identification
Abstract
ObjectiveLung cancer is a deadly disease with the highest 5-year survival rate. Lung adenocarcinoma is the main subtype of non-small cell lung cancer (NSCLC). Correct staging is critical as the basis of treatment. So, the identification of genes associated with pathologic stages of lung adenocarcinoma is helpful in understanding the pathological mechanism and designing targeted therapeutic drugs. MethodsRandom forest was suitable for high-dimensional data to identify variables associated with the outcome. The variable importance-based selection method was used to rank the candidate genes associated with pathologic stages of lung adenocarcinoma. Univariate regression was used to analyze the relationship between gene expression and prognosis. The protein-protein interaction network was used to show the interactions among the identified genes. The identified genes functional enrichment analysis was performed by GSEA software. ResultsTwelve genes significantly associated with pathologic stages of lung adenocarcinoma were identified by random forest analysis. Eight of these genes were found to play roles in survival of patients with lung adenocarcinoma. Among the 12 genes, 4 genes such as CENPH, SRSF5, PITX2 and NSG1 interacted with each other. And these genes were mainly enriched in p53 signaling pathway, cell cycle signaling pathway, JAK STAT signaling pathway and DNA replication signaling pathway. ConclusionThe identified genes may drive the changes among pathologic stages of lung adenocarcinoma and will be helpful in understanding the molecular changes underlying the pathologic stages.
Keywords