Applied Sciences (Dec 2022)
Clustering-Based Segmented Regression for Particulate Matter Sensor Calibration
Abstract
Nowadays, sensor-based air pollution sensing systems are widely deployed for fine-grained pollution monitoring. In-field calibration plays an important role in maintaining sensory data quality. Determining the model structure is challenging using existing methods of variable global fitting models for in-field calibration. This is because the mechanism of interference factors is complex and there is often insufficient prior knowledge on a specific sensor type. Although Artificial-Neuron-Net-based (ANN-based) methods ignore the complex conditions above, they also have problems regarding generalization, interpretability, and calculation cost. In this paper, we propose a clustering-based segmented regression method for particulate matter (PM) sensor in-field calibration. Interference from relative humidity and temperature are taken into consideration in the particulate matter concentration calibration model. Samples for modeling are divided into clusters and each cluster has an individual multiple linear regression equation. The final calibrated result of one sample is calculated from the regression model of the cluster the sample belongs to. The proposed method is evaluated under in-field deployment and performs better than a global multiple regression method both on PM2.5 and PM10 pollutants with, respectively, at least 16% and 9% improvement ratio on RMSE error. In addition, the proposed method is insensitive to reduction of training data and increase in cluster number. Moreover, it may bear lighter calculation cost, less overfitting problems and better interpretability. It can improve the efficiency and performance of post-deployment sensor calibration.
Keywords