Energies (May 2024)
Regression Modeling of Daily PM<sub>2.5</sub> Concentrations with a Multilayer Perceptron
Abstract
Various types of energetic fuel combustion processes emit dangerous pollutants into the air, including aerosol particles, marked as PM10. Routine air quality monitoring includes determining the PM10 concentration as one of the basic measurements. At some air monitoring stations, the PM10 measurement is supplemented by the simultaneous determination of the concentration of PM2.5 as a finer fraction of suspended particles. Since the PM2.5 fraction has a significant share in the PM10 fraction, the concentrations of both types of particles should be strongly correlated, and the concentrations of one of these fractions can be used to model the concentrations of the other fraction. The aim of the study was to assess the error of predicting PM2.5 concentration using PM10 concentration as the main predictor. The analyzed daily concentrations were measured at 11 different monitoring stations in Poland and covered the period 2010–2021. MLP (multilayer perceptron) artificial neural networks were used to approximate the daily PM2.5 concentrations. PM10 concentrations and time variables were tested as predictors in neural networks. Several different prediction errors were taken as measures of modeling quality. Depending on the monitoring station, in models with one PM10 predictor, the RMSE error values were in the range of 2.31–6.86 μg/m3. After taking into account the second predictor D (date), the corresponding RMSE errors were lower and were in the range of 2.06–5.54 μg/m3. Our research aimed to find models that were as simple and universal as possible. In our models, the main predictor is the PM10 concentration; therefore, the only condition to be met is monitoring the measurement of PM10 concentrations. We showed that models trained at other air monitoring stations, so-called foreign models, can be successfully used to approximate PM2.5 concentrations at another station.
Keywords