Prediction of atmospheric PM2.5 level by machine learning techniques in Isfahan, Iran

Farzaneh Mohammadi; Hakimeh Teiri; Yaghoub Hajizadeh; Ali Abdolahnejad; Afshin Ebrahimi

doi:10.1038/s41598-024-52617-z

Scientific Reports (Jan 2024)

Prediction of atmospheric PM2.5 level by machine learning techniques in Isfahan, Iran

Farzaneh Mohammadi,
Hakimeh Teiri,
Yaghoub Hajizadeh,
Ali Abdolahnejad,
Afshin Ebrahimi

Affiliations

Farzaneh Mohammadi: Department of Environmental Health Engineering, Faculty of Health, Isfahan University of Medical Sciences
Hakimeh Teiri: Department of Environmental Health Engineering, Faculty of Health, Isfahan University of Medical Sciences
Yaghoub Hajizadeh: Environment Research Center, Research Institute for Primordial Prevention of Non-Communicable Diseases, Isfahan University of Medical Sciences
Ali Abdolahnejad: Department of Environmental Health Engineering, School of Public Health, Maragheh University of Medical Sciences
Afshin Ebrahimi: Environment Research Center, Research Institute for Primordial Prevention of Non-Communicable Diseases, Isfahan University of Medical Sciences

DOI: https://doi.org/10.1038/s41598-024-52617-z
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 12

Abstract

Read online

Abstract With increasing levels of air pollution, air quality prediction has attracted more attention. Mathematical models are being developed by researchers to achieve precise predictions. Monitoring and prediction of atmospheric PM2.5 levels, as a predominant pollutant, is essential in emission mitigation programs. In this study, meteorological datasets from 9 years in Isfahan city, a large metropolis of Iran, were applied to predict the PM2.5 levels, using four machine learning algorithms including Artificial Neural |Networks (ANNs), K-Nearest-Neighbors (KNN), Support Vector |Machines (SVMs) and ensembles of classification trees Random Forest (RF). The data from 7 air quality monitoring stations located in Isfahan City were taken into consideration. The Confusion Matrix and Cross-Entropy Loss were used to analyze the performance of classification models. Several parameters, including sensitivity, specificity, accuracy, F1 score, precision, and the area under the curve (AUC), are computed to assess model performance. Finally, by introducing the predicted data for 2020 into ArcGIS software and using the IDW (Inverse Distance Weighting) method, interpolation was conducted for the area of Isfahan city and the pollution map was illustrated for each month of the year. The results showed that, based on the accuracy percentage, the ANN model has a better performance (90.1%) in predicting PM2.5 grades compared to the other models for the applied meteorological dataset, followed by RF (86.1%), SVM (84.6%) and KNN (82.2%) models, respectively. Therefore, ANN modelling provides a feasible procedure for the managerial planning of air pollution control.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal