Intelligent Systems with Applications (Jun 2024)

Air quality monitoring using statistical learning models for sustainable environment

  • Mohsin Imam,
  • Sufiyan Adam,
  • Soumyabrata Dev,
  • Nashreen Nesa

Journal volume & issue
Vol. 22
p. 200333

Abstract

Read online

High levels of air pollutants pose significant health risks, increasing the chances of respiratory infections, lung cancer, and heart complications, particularly for those already susceptible to illness. Modern societal advancements have worsened air quality degradation, with daily activities such as transportation, industrial processes, and domestic operations releasing harmful contaminants. This study addresses the urgent need for air quality monitoring and forecasting, especially in developing nations like India, where machine learning-based prediction technologies play a crucial role in understanding environmental aspects. Our study focuses on analyzing and predicting air quality using data from two distinct areas in Kolkata—Victoria and Rabindra. Thorough pre-processing and data analysis have been conducted to identify essential features and detect undermining patterns in the data. Utilizing five classic machine learning algorithms, we predict the air quality category by categorizing the predicted AQI into six AQI classes, emphasizing extensive hyper-parameter tuning for each model. The Rabindra dataset yields the best-performing Support Vector Machine (SVC) model with a 97.98% accuracy, while the best prediction accuracy for Victoria location is 93.29% from Random Forest Classifier (RFC). This study offers valuable insights into the effectiveness of machine learning algorithms for air quality prediction, with the novelty of the study lying in the focus on hyper-parameter tuning to achieve the highest accuracy.

Keywords