BMC Pulmonary Medicine (Apr 2023)

Identifying chronic obstructive pulmonary disease from integrative omics and clustering in lung tissue

  • Brian D Hobbs,
  • Jarrett D Morrow,
  • Xu-Wen Wang,
  • Yang-Yu Liu,
  • Dawn L DeMeo,
  • Craig P Hersh,
  • Bartolome R Celli,
  • Raphael Bueno,
  • Gerard J Criner,
  • Edwin K Silverman,
  • Michael H Cho

DOI
https://doi.org/10.1186/s12890-023-02389-5
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Chronic obstructive pulmonary disease (COPD) is a highly morbid and heterogenous disease. While COPD is defined by spirometry, many COPD characteristics are seen in cigarette smokers with normal spirometry. The extent to which COPD and COPD heterogeneity is captured in omics of lung tissue is not known. Methods We clustered gene expression and methylation data in 78 lung tissue samples from former smokers with normal lung function or severe COPD. We applied two integrative omics clustering methods: (1) Similarity Network Fusion (SNF) and (2) Entropy-Based Consensus Clustering (ECC). Results SNF clusters were not significantly different by the percentage of COPD cases (48.8% vs. 68.6%, p = 0.13), though were different according to median forced expiratory volume in one second (FEV1) % predicted (82 vs. 31, p = 0.017). In contrast, the ECC clusters showed stronger evidence of separation by COPD case status (48.2% vs. 81.8%, p = 0.013) and similar stratification by median FEV1% predicted (82 vs. 30.5, p = 0.0059). ECC clusters using both gene expression and methylation were identical to the ECC clustering solution generated using methylation data alone. Both methods selected clusters with differentially expressed transcripts enriched for interleukin signaling and immunoregulatory interactions between lymphoid and non-lymphoid cells. Conclusions Unsupervised clustering analysis from integrated gene expression and methylation data in lung tissue resulted in clusters with modest concordance with COPD, though were enriched in pathways potentially contributing to COPD-related pathology and heterogeneity.

Keywords