Frontiers in Microbiology (Feb 2024)

Identification of important modules and biomarkers in tuberculosis based on WGCNA

  • Jing Dong,
  • Jing Dong,
  • Jing Dong,
  • Ruixue Song,
  • Ruixue Song,
  • Ruixue Song,
  • Xuetian Shang,
  • Xuetian Shang,
  • Xuetian Shang,
  • Yingchao Wang,
  • Yingchao Wang,
  • Yingchao Wang,
  • Qiuyue Liu,
  • Qiuyue Liu,
  • Zhiguo Zhang,
  • Hongyan Jia,
  • Hongyan Jia,
  • Hongyan Jia,
  • Mailing Huang,
  • Mailing Huang,
  • Chuanzhi Zhu,
  • Chuanzhi Zhu,
  • Chuanzhi Zhu,
  • Qi Sun,
  • Qi Sun,
  • Qi Sun,
  • Boping Du,
  • Boping Du,
  • Boping Du,
  • Aiying Xing,
  • Aiying Xing,
  • Aiying Xing,
  • Zihui Li,
  • Zihui Li,
  • Zihui Li,
  • Lanyue Zhang,
  • Lanyue Zhang,
  • Lanyue Zhang,
  • Liping Pan,
  • Liping Pan,
  • Liping Pan,
  • Zongde Zhang,
  • Zongde Zhang,
  • Zongde Zhang

DOI
https://doi.org/10.3389/fmicb.2024.1354190
Journal volume & issue
Vol. 15

Abstract

Read online

BackgroundTuberculosis (TB) is a significant public health concern, particularly in China. Long noncoding RNAs (lncRNAs) can provide abundant pathological information regarding etiology and could include candidate biomarkers for diagnosis of TB. However, data regarding lncRNA expression profiles and specific lncRNAs associated with TB are limited.MethodsWe performed ceRNA-microarray analysis to determine the expression profile of lncRNAs in peripheral blood mononuclear cells (PBMCs). Weighted gene co-expression network analysis (WGCNA) was then conducted to identify the critical module and genes associated with TB. Other bioinformatics analyses, including Kyoto Encyclopedia of Genes and Genomes (KEGG), Gene Ontology (GO), and co-expression networks, were conducted to explore the function of the critical module. Finally, real-time quantitative polymerase chain reaction (qPCR) was used to validate the candidate biomarkers, and receiver operating characteristic analysis was used to assess the diagnostic performance of the candidate biomarkers.ResultsBased on 8 TB patients and 9 healthy controls (HCs), a total of 1,372 differentially expressed lncRNAs were identified, including 738 upregulated lncRNAs and 634 downregulated lncRNAs. Among all lncRNAs and mRNAs in the microarray, the top 25% lncRNAs (3729) and top 25% mRNAs (2824), which exhibited higher median expression values, were incorporated into the WGCNA. The analysis generated 16 co-expression modules, among which the blue module was highly correlated with TB. GO and KEGG analyses showed that the blue module was significantly enriched in infection and immunity. Subsequently, considering module membership values (>0.85), gene significance values (>0.90) and fold-change value (>2 or < 0.5) as selection criteria, the top 10 upregulated lncRNAs and top 10 downregulated lncRNAs in the blue module were considered as potential biomarkers. The candidates were then validated in an independent validation sample set (31 TB patients and 32 HCs). The expression levels of 8 candidates differed significantly between TB patients and HCs. The lncRNAs ABHD17B (area under the curve [AUC] = 1.000) and ENST00000607464.1 (AUC = 1.000) were the best lncRNAs in distinguishing TB patients from HCs.ConclusionThis study characterized the lncRNA profiles of TB patients and identified a significant module associated with TB as well as novel potential biomarkers for TB diagnosis.

Keywords