Ecotoxicology and Environmental Safety (Jan 2024)

Development of interpretable machine learning models associated with environmental chemicals to predict all-cause and specific-cause mortality:A longitudinal study based on NHANES

  • Siyu Duan,
  • Yafei Wu,
  • Junmin Zhu,
  • Xing Wang,
  • Yaheng Zhang,
  • Chenming Gu,
  • Ya Fang

Journal volume & issue
Vol. 270
p. 115864

Abstract

Read online

Limited information is available on potential predictive value of environmental chemicals for mortality. Our study aimed to investigate the associations between 43 of 8 classes representative environmental chemicals in serum/urine and mortality, and further develop the interpretable machine learning models associated with environmental chemicals to predict mortality. A total of 1602 participants were included from the National Health and Nutrition Examination Survey (NHANES). During 154,646 person-months of follow-up, 127 deaths occurred. We found that machine learning showed promise in predicting mortality. CoxPH was selected as the optimal model for predicting all-cause mortality with time-dependent AUROC of 0.953 (95%CI: 0.951–0.955). Coxnet was the best model for predicting cardiovascular disease (CVD) and cancer mortality with time-dependent AUROCs of 0.935 (95%CI: 0.933–0.936) and 0.850 (95%CI: 0.844–0.857). Based on clinical variables, adding environmental chemicals could enhance the predictive ability of cancer mortality (P < 0.05). Some environmental chemicals contributed more to the models than traditional clinical variables. Combined the results of association and prediction models by interpretable machine learning analyses, we found urinary methyl paraben (MP) and urinary 2-napthol (2-NAP) were negatively associated with all-cause mortality, while serum cadmium (Cd) was positively associated with all-cause mortality. Urinary bisphenol A (BPA) was positively associated with CVD mortality.

Keywords