BMC Pulmonary Medicine (Sep 2024)
Construction of an artificial neural network diagnostic model and investigation of immune cell infiltration characteristics for idiopathic pulmonary fibrosis
Abstract
Abstract Background Idiopathic pulmonary fibrosis (IPF) is a severe lung condition, and finding better ways to diagnose and treat the disease is crucial for improving patient outcomes. Our study sought to develop an artificial neural network (ANN) model for IPF and determine the immune cell types that differed between the IPF and control groups. Methods From the Gene Expression Omnibus (GEO) database, we first obtained IPF microarray datasets. To conduct protein-protein interaction (PPI) networks and enrichment analyses, differentially expressed genes (DEGs) were screened between tissues of patients with IPF and tissues of controls. Afterward, we identified the important feature genes associated with IPF using random forest (RF) analysis, and then constructed and validated a prediction ANN mode. In addition, the proportions of immune cells were quantified using cell-type identification by estimating relative subsets of RNA transcripts (CIBERSORT) analysis, which was performed on microarray datasets based on gene expression profiling. Results A total of 11 downregulated and 36 upregulated DEGs were identified. PPI networks and enrichment analyses were carried out; the immune system and extracellular matrix were the subjects of the enrichments. Using RF analysis, the significant feature genes LRRC17, COMP, ASPN, CRTAC1, POSTN, COL3A1, PEBP4, IL13RA2, and CA4 were identified. The nine feature gene scores were integrated into the ANN to develop a diagnostic prediction model. The receiver operating characteristic (ROC) curves demonstrated the strong diagnostic ability of the ANN in predicting IPF in the training and testing sets. An analysis of IPF tissues in comparison to normal tissues revealed a reduction in the infiltration of natural killer cells resting, monocytes, macrophages M0, and neutrophils; conversely, the infiltration of T cells CD4 memory resting, mast cells, and macrophages M0 increased. Conclusion LRRC17, COMP, ASPN, CRTAC1, POSTN, COL3A1, PEBP4, IL13RA2, and CA4 were determined as key feature genes for IPF. The nine feature genes in the ANN model will be extremely important for diagnosing IPF. It may be possible to use differentiated immune cells from IPF samples in comparison to normal samples as targets for immunotherapy in patients with IPF.
Keywords