World Journal of Surgical Oncology (Jul 2024)
Population-Specific gene expression profiles in prostate cancer: insights from Weighted Gene Co-expression Network Analysis (WGCNA)
Abstract
Abstract This study investigates the genetic factors contributing to the disparity in prostate cancer incidence and progression among African American men (AAM) compared to European American men (EAM). The research focuses on employing Weighted Gene Co-expression Network Analysis (WGCNA) on public microarray data obtained from prostate cancer patients. The study employed WGCNA to identify clusters of genes with correlated expression patterns, which were then analyzed for their connection to population backgrounds. Additionally, pathway enrichment analysis was conducted to understand the significance of the identified gene modules in prostate cancer pathways. The Least Absolute Shrinkage and Selection Operator (LASSO) and Correlation-based Feature Selection (CFS) methods were utilized for selection of biomarker genes. The results revealed 353 differentially expressed genes (DEGs) between AAM and EAM. Six significant gene expression modules were identified through WGCNA, showing varying degrees of correlation with prostate cancer. LASSO and CFS methods pinpointed critical genes, as well as six common genes between both approaches, which are indicative of their vital role in the disease. The XGBoost classifier validated these findings, achieving satisfactory prediction accuracy. Genes such as APRT, CCL2, BEX2, MGC26963, and PLAU were identified as key genes significantly associated with cancer progression. In conclusion, the research underlines the importance of incorporating AAM and EAM population diversity in genomic studies, particularly in cancer research. In addition, the study highlights the effectiveness of integrating machine learning techniques with gene expression analysis as a robust methodology for identifying critical genes in cancer research.
Keywords