Journal of Environmental Engineering and Landscape Management (Nov 2024)

Statistical and machine learning approaches for estimating pollution of fine particulate matter (PM2.5) in Vietnam

  • Tuyet Nam Thi Nguyen,
  • Tan Dat Trinh,
  • Pham Cung Le Thien Vu,
  • Pham The Bao

DOI
https://doi.org/10.3846/jeelm.2024.22361
Journal volume & issue
Vol. 32, no. 4

Abstract

Read online

This study aims to predict fine particulate matter (PM2.5) pollution in Ho Chi Minh City, Vietnam, using autoregressive integrated moving average (ARIMA), linear regression (LR), random forest (RF), long short-term memory (LSTM), bidirectional LSTM (Bi-LSTM), and convolutional neural network (CNN) combining Bi-LSTM (CNN+Bi-LSTM). Two experiments were set up: the first one used data from 2018–2020 and 2021 as training and test data, respectively. Data from 2018–2021 and 2022 were used as training and test data for the second experiment, respectively. Consequently, ARIMA showed the worst performance, while CNN+Bi-LSTM achieved the best accuracy, with an R² of 0.70 and MAE, MSE, RMSE, and MAPE of 5.37, 65.4, 8.08 µg/m³, and 29%, respectively. Additionally, predicted air quality indexes (AQIs) of PM2.5 were matched the observed ones up to 96%, reflecting the application of predicted concentrations for AQI computation. Our study highlights the effectiveness of machine learning model in monitoring of air pollution.

Keywords