Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model

Xuchu Jiang; Peiyao Wei; Yiwen Luo; Ying Li

doi:10.3390/atmos12111452

Atmosphere (Nov 2021)

Air Pollutant Concentration Prediction Based on a CEEMDAN-FE-BiLSTM Model

Xuchu Jiang,
Peiyao Wei,
Yiwen Luo,
Ying Li

Affiliations

Xuchu Jiang: School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, China
Peiyao Wei: School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, China
Yiwen Luo: School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, China
Ying Li: Department of Scientific Research, Zhongnan University of Economics and Law, Wuhan 430073, China

DOI: https://doi.org/10.3390/atmos12111452
Journal volume & issue: Vol. 12, no. 11
p. 1452

Abstract

Read online

The concentration series of PM2.5 (particulate matter ≤ 2.5 μm) is nonlinear, nonstationary, and noisy, making it difficult to predict accurately. This paper presents a new PM2.5 concentration prediction method based on a hybrid model of complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and bi-directional long short-term memory (BiLSTM). The new method was applied to predict the same kind of particulate pollutant PM10 and heterogeneous gas pollutant O3, proving that the prediction method has strong generalization ability. First, CEEMDAN was used to decompose PM2.5 concentrations at different frequencies. Then, the fuzzy entropy (FE) value of each decomposed wave was calculated, and the near waves were combined by K-means clustering to generate the input sequence. Finally, the combined sequences were put into the BiLSTM model with multiple hidden layers for training. We predicted the PM2.5 concentrations of Seoul Station 116 by the hour, with values of the root mean square error (RMSE), the mean absolute error (MAE), and the symmetric mean absolute percentage error (SMAPE) as low as 2.74, 1.90, and 13.59%, respectively, and an R2 value as high as 96.34%. The “CEEMDAN-FE” decomposition-merging technology proposed in this paper can effectively reduce the instability and high volatility of the original data, overcome data noise, and significantly improve the model’s performance in predicting the real-time concentrations of PM2.5.

Published in Atmosphere

ISSN: 2073-4433 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Physics: Meteorology. Climatology
Website: http://www.mdpi.com/journal/atmosphere/

About the journal

Abstract

Keywords