mSystems (Oct 2024)
HCMV detection in Asian gastric cancer RNA-seq data sets and clinical validation in Indian GC patients reveals the HCMV-GC specific gene signatures
Abstract
ABSTRACT Gastric cancer (GC) prevalence is very high in the Asian population. Oncogenic viruses play a crucial role in the progression of different types of cancers. Through reanalysis of clinical RNA-seq data sets derived from Asian GC patients, this study identified the presence of human cytomegalovirus (HCMV) in Asian GC tumors, next to the well-studied association of EBV. Clinical recruitment of the Indian GC cohort and screening for HCMV presence identified a 14.28% occurrence, similar to that observed in the bioinformatics analysis. A combinatorial approach of rank-based meta-analysis and ranking of groups based on an expectation-maximization algorithm identified that the upregulated LINC02864 and MAGEA10 correlated with poor survival of GC patients and downregulated tumor suppressor genes enriching for gastric acid secretion pathway to be associated with HCMV-positive GC patients, revealing the progressive role of HCMV infection in GC. Genes that discriminate between different stages of GC were identified through feature selection implemented in a machine-learning approach. LTF and KLK10 expressions were found to be specifically dysregulated by HCMV and can also indicate the GC stages. The results of this study will guide future studies to identify the functional role of these genes in the HCMV-associated GC.IMPORTANCENearly 75% of gastric cancer (GC) cases reported globally are from the Asian population. Most existing public databases, such as TCGA, comprise only a fractional portion of data derived from Asian ancestry. This study identified EBV and human cytomegalovirus (HCMV)'s higher detection in GC patients. The presence and role of EBV associated with GC are well-known, and the observation of HCMV prompted us to validate the findings in a small cohort of 40 Indian GC patients. We observed a 14.28% occurrence of HCMV in the Indian cohort, similar to that observed from next-generation sequencing. A combinatorial approach of rank-based meta-analysis and ranking of groups based on an expectation-maximization algorithm identified that the upregulated LINC02864 and MAGEA10 correlated with poor survival of GC patients and downregulated tumor suppressor genes enriching for gastric acid secretion pathway to be associated with HCMV-positive GC patients, revealing the progressive role of HCMV infection in GC.
Keywords