BMC Pulmonary Medicine (May 2022)

Early prediction of moderate-to-severe condition of inhalation-induced acute respiratory distress syndrome via interpretable machine learning

  • Junwei Wu,
  • Chao Liu,
  • Lixin Xie,
  • Xiang Li,
  • Kun Xiao,
  • Guotong Xie,
  • Fei Xie

DOI
https://doi.org/10.1186/s12890-022-01963-7
Journal volume & issue
Vol. 22, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Background Several studies have investigated the correlation between physiological parameters and the risk of acute respiratory distress syndrome (ARDS), in addition, etiology-associated heterogeneity in ARDS has become an emerging topic quite recently; however, the intersection between the two, which is early prediction of target conditions in etiology-specific ARDS, has not been well-studied. We aimed to develop and validate a machine-learning model for the early prediction of moderate-to-severe condition of inhalation-induced ARDS. Methods Clinical expertise was applied with data-driven analysis. Using data from electronic intensive care units (retrospective derivation cohort) and the three most accessible vital signs (i.e. heart rate, temperature, and respiratory rate) together with feature engineering, we applied a random forest approach during the time window of 90 h that ended 6 h prior to the onset of moderate-to-severe respiratory failure (the ratio of partial pressure of arterial oxygen to fraction of inspired oxygen ≤ 200 mmHg). Results The trained random forest classifier was validated using two independent validation cohorts, with an area under the curve of 0.9127 (95% confidence interval 0.8713–0.9542) and 0.9026 (95% confidence interval 0.8075–1), respectively. A Stable and Interpretable RUle Set (SIRUS) was used to extract rules from the RF to provide guidelines for clinicians. We identified several predictive factors, including resp_96h_6h_min 100, that could be used for predicting inhalation-induced ARDS (moderate-to-severe condition) 6 h prior to onset in critical care units. (‘xxx_96h_6h_min/mean/max’: the minimum/mean/maximum values of the xxx vital sign collected during a 90 h time window beginning 96 h prior to the onset of ARDS and ending 6 h prior to the onset from every recorded blood gas test). Conclusions This newly established random forest‑based interpretable model shows good predictive ability for moderate-to-severe inhalation-induced ARDS and may assist clinicians in decision-making, as well as facilitate the enrolment of patients in prevention programmes to improve their outcomes.

Keywords