Scientific Reports (Mar 2021)

Latent traits of lung tissue patterns in former smokers derived by dual channel deep learning in computed tomography images

  • Frank Li,
  • Jiwoong Choi,
  • Chunrui Zou,
  • John D. Newell,
  • Alejandro P. Comellas,
  • Chang Hyun Lee,
  • Hongseok Ko,
  • R. Graham Barr,
  • Eugene R. Bleecker,
  • Christopher B. Cooper,
  • Fereidoun Abtin,
  • Igor Barjaktarevic,
  • David Couper,
  • MeiLan Han,
  • Nadia N. Hansel,
  • Richard E. Kanner,
  • Robert Paine,
  • Ella A. Kazerooni,
  • Fernando J. Martinez,
  • Wanda O’Neal,
  • Stephen I. Rennard,
  • Benjamin M. Smith,
  • Prescott G. Woodruff,
  • Eric A. Hoffman,
  • Ching-Long Lin

DOI
https://doi.org/10.1038/s41598-021-84547-5
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease and the traditional variables extracted from computed tomography (CT) images may not be sufficient to describe all the topological features of lung tissues in COPD patients. We employed an unsupervised three-dimensional (3D) convolutional autoencoder (CAE)-feature constructor (FC) deep learning network to learn from CT data and derive tissue pattern-clusters jointly. We then applied exploratory factor analysis (EFA) to discover the unobserved latent traits (factors) among pattern-clusters. CT images at total lung capacity (TLC) and residual volume (RV) of 541 former smokers and 59 healthy non-smokers from the cohort of the SubPopulations and Intermediate Outcome Measures in the COPD Study (SPIROMICS) were analyzed. TLC and RV images were registered to calculate the Jacobian (determinant) values for all the voxels in TLC images. 3D Regions of interest (ROIs) with two data channels of CT intensity and Jacobian value were randomly extracted from training images and were fed to the 3D CAE-FC model. 80 pattern-clusters and 7 factors were identified. Factor scores computed for individual subjects were able to predict spirometry-measured pulmonary functions. Two factors which correlated with various emphysema subtypes, parametric response mapping (PRM) metrics, airway variants, and airway tree to lung volume ratio were discriminants of patients across all severity stages. Our findings suggest the potential of developing factor-based surrogate markers for new COPD phenotypes.