PLoS ONE (Jan 2024)

Computational reassessment of RNA-seq data reveals key genes in active tuberculosis.

  • Rakesh Arya,
  • Hemlata Shakya,
  • Reetika Chaurasia,
  • Surendra Kumar,
  • Joseph M Vinetz,
  • Jong Joo Kim

DOI
https://doi.org/10.1371/journal.pone.0305582
Journal volume & issue
Vol. 19, no. 6
p. e0305582

Abstract

Read online

BackgroundTuberculosis is a serious life-threatening disease among the top global health challenges and rapid and effective diagnostic biomarkers are vital for early diagnosis especially given the increasing prevalence of multidrug resistance.MethodsTwo human whole blood microarray datasets, GSE42826 and GSE42830 were retrieved from publicly available gene expression omnibus (GEO) database. Deregulated genes (DEGs) were identified using GEO2R online tool and Gene Ontology (GO), protein-protein interaction (PPI) network analysis was performed using Metascape and STRING databases. Significant genes (n = 8) were identified using T-test/ANOVA and Molecular Complex Detection (MCODE) score ≥10, which was validated in GSE34608 dataset. The diagnostic potential of three biomarkers was assessed using Area Under Curve (AUC) of Receiver Operating Characteristic (ROC) plot. The transcriptional levels of these genes were also examined in a separate dataset GSE31348, to monitor the patterns of variation during tuberculosis treatment.ResultsA total of 62 common DEGs (57 upregulated, 7 downregulated genes) were identified in two discovery datasets. GO functions and pathway enrichment analysis shed light on the functional roles of these DEGs in immune response and type-II interferon signaling. The genes in Module-1 (n = 18) were linked to innate immune response, interferon-gamma signaling. The common genes (n = 8) were validated in GSE34608 dataset, that corroborates the results obtained from discovery sets. The gene expression levels demonstrated responsiveness to Mtb infection during anti-TB therapy in GSE31348 dataset. In GSE34608 dataset, the expression levels of three specific genes, GBP5, IFITM3, and EPSTI1, emerged as potential diagnostic makers. In combination, these genes scored remarkable diagnostic performance with 100% sensitivity and 89% specificity, resulting in an impressive Area Under Curve (AUC) of 0.958. However, GBP5 alone showed the highest AUC of 0.986 with 100% sensitivity and 89% specificity.ConclusionsThe study presents valuable insights into the critical gene network perturbed during tuberculosis. These genes are determinants for assessing the effectiveness of an anti-TB response and distinguishing between active TB and healthy individuals. GBP5, IFITM3 and EPSTI1 emerged as candidate core genes in TB and holds potential as novel molecular targets for the development of interventions in the treatment of TB.