Environmental Science and Ecotechnology (May 2024)

Knowledge-guided machine learning reveals pivotal drivers for gas-to-particle conversion of atmospheric nitrate

  • Bo Xu,
  • Haofei Yu,
  • Zongbo Shi,
  • Jinxing Liu,
  • Yuting Wei,
  • Zhongcheng Zhang,
  • Yanqi Huangfu,
  • Han Xu,
  • Yue Li,
  • Linlin Zhang,
  • Yinchang Feng,
  • Guoliang Shi

Journal volume & issue
Vol. 19
p. 100333

Abstract

Read online

Particulate nitrate, a key component of fine particles, forms through the intricate gas-to-particle conversion process. This process is regulated by the gas-to-particle conversion coefficient of nitrate (ε(NO3−)). The mechanism between ε(NO3−) and its drivers is highly complex and nonlinear, and can be characterized by machine learning methods. However, conventional machine learning often yields results that lack clear physical meaning and may even contradict established physical/chemical mechanisms due to the influence of ambient factors. It urgently needs an alternative approach that possesses transparent physical interpretations and provides deeper insights into the impact of ε(NO3−). Here we introduce a supervised machine learning approach—the multilevel nested random forest guided by theory approaches. Our approach robustly identifies NH4+, SO42−, and temperature as pivotal drivers for ε(NO3−). Notably, substantial disparities exist between the outcomes of traditional random forest analysis and the anticipated actual results. Furthermore, our approach underscores the significance of NH4+ during both daytime (30%) and nighttime (40%) periods, while appropriately downplaying the influence of some less relevant drivers in comparison to conventional random forest analysis. This research underscores the transformative potential of integrating domain knowledge with machine learning in atmospheric studies.

Keywords