Remote Sensing (Aug 2020)

A Framework to Predict High-Resolution Spatiotemporal PM<sub>2.5</sub> Distributions Using a Deep-Learning Model: A Case Study of Shijiazhuang, China

  • Guangyuan Zhang,
  • Haiyue Lu,
  • Jin Dong,
  • Stefan Poslad,
  • Runkui Li,
  • Xiaoshuai Zhang,
  • Xiaoping Rui

DOI
https://doi.org/10.3390/rs12172825
Journal volume & issue
Vol. 12, no. 17
p. 2825

Abstract

Read online

Air-borne particulate matter, PM2.5 (PM having a diameter of less than 2.5 micrometers), has aroused widespread concern and is a core indicator of severe air pollution in many cities globally. In our study, we present a validated framework to predict the daily PM2.5 distributions, exemplified by a use case of Shijiazhuang City, China, based on daily aerosol optical depth (AOD) datasets. The framework involves obtaining the high-resolution spatiotemporal AOD distributions, estimation of the spatial distributions of PM2.5 and the prediction of these based on a convolutional long short-term memory (ConvLSTM) model. In the estimation part, the eXtreme gradient boosting (XGBoost) model has been determined as the estimation model with the lowest root mean square error (RMSE) of 32.86 µg/m3 and the highest coefficient of determination regression score function (R2) of 0.71, compared to other common models used as a baseline for comparison (linear, ridge, least absolute shrinkage and selection operator (LASSO) and cubist). For the prediction part, after validation and comparison with a seasonal autoregressive integrated moving average (SARIMA), which is a traditional time-series prediction model, in both time and space, the ConvLSTM gives a more accurate performance for the prediction, with a total average prediction RMSE of 14.94 µg/m3 compared to SARIMA’s 17.41 µg/m3. Furthermore, ConvLSTM is more stable and with less fluctuations for the prediction of PM2.5 in time, and it can also eliminate better the spatial predicted errors compared to SARIMA.

Keywords