BMC Medical Genomics (Feb 2024)
Identifying functional subtypes of IgA nephropathy based on three machine learning algorithms and WGCNA
Abstract
Abstract Background IgA nephropathy (IgAN) is one of the most common primary glomerulonephritis, which is a significant cause of renal failure. At present, the classification of IgAN is often limited to pathology, and its molecular mechanism has not been established. Therefore we aim to identify subtypes of IgAN at the molecular level and explore the heterogeneity of subtypes in terms of immune cell infiltration, functional level. Methods Two microarray datasets (GSE116626 and GSE115857) were downloaded from GEO. Differential expression genes (DEGs) for IgAN were screened with limma. Three unsupervised clustering algorithms (hclust, PAM, and ConsensusClusterPlus) were combined to develop a single-sample subtype random forest classifier (SSRC). Functional subtypes of IgAN were defined based on functional analysis and current IgAN findings. Then the correlation between IgAN subtypes and clinical features such as eGFR and proteinuria was evaluated by using Pearson method. Subsequently, subtype heterogeneity was verified by subtype-specific modules identification based on weighted gene co-expression network analysis(WGCNA) and immune cell infiltration analysis based on CIBERSORT algorithm. Results We identified 102 DEGs as marker genes for IgAN and three functional subtypes namely: viral-hormonal, bacterial-immune and mixed type. We screened seventeen genes specific to viral hormonal type (ATF3, JUN and FOS etc.), and seven genes specific to bacterial immune type (LIF, C19orf51 and SLPI etc.). The subtype-specific genes showed significantly high correlation with proteinuria and eGFR. The WGCNA modules were in keeping with functions of the IgAN subtypes where the MEcyan module was specific to the viral-hormonal type and the MElightgreen module was specific to the bacterial-immune type. The results of immune cell infiltration revealed subtype-specific cell heterogeneity which included significant differences in T follicular helper cells, resting NK cells between viral-hormone type and control group; significant differences in eosinophils, monocytes, macrophages, mast cells and other cells between bacterial-immune type and control. Conclusion In this study, we identified three functional subtypes of IgAN for the first time and specific expressed genes for each subtype. Then we constructed a subtype classifier and classify IgAN patients into specific subtypes, which may be benefit for the precise treatment of IgAN patients in future.
Keywords