Atmosphere (Jul 2022)

An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air Pollutants

  • Ching-Hsue Cheng,
  • Ming-Chi Tsai

DOI
https://doi.org/10.3390/atmos13071055
Journal volume & issue
Vol. 13, no. 7
p. 1055

Abstract

Read online

Rapid industrialization and urban development are the main causes of air pollution, leading to daily air quality and health problems. To find significant pollutants and forecast their concentrations, in this study, we used a hybrid methodology, including integrated variable selection, autoregressive distributed lag, and deleted multiple collinear variables to reduce variables, and then applied six intelligent time series models to forecast the concentrations of the top three pollution sources. We collected two air quality datasets from traffic and industrial monitoring stations and weather data to analyze and compare their results. The results show that a random forest based on selected key variables has better classification metrics (accuracy, AUC, recall, precision, and F1). After deleting the collinearity of the independent variables and adding the lag periods using the autoregressive distributed lag model, the intelligent time-series support vector regression was found to have better forecasting performance (RMSE and MAE). Finally, the research results could be used as a reference by all relevant stakeholders and help respond to poor air quality.

Keywords