Machines (Mar 2022)

Optimal Classifier to Detect Unit of Measure Inconsistency in Gas Turbine Sensors

  • Lucrezia Manservigi,
  • Mauro Venturini,
  • Enzo Losi,
  • Giovanni Bechini,
  • Javier Artal de la Iglesia

DOI
https://doi.org/10.3390/machines10040228
Journal volume & issue
Vol. 10, no. 4
p. 228

Abstract

Read online

Label noise is a harmful issue that arises when data are erroneously labeled. Several label noise issues can occur but, among them, unit of measure inconsistencies (UMIs) are inexplicably neglected in the literature. Despite its relevance, a general and automated approach for UMI detection suitable to gas turbines (GTs) has not been developed yet; as a result, GT diagnosis, prognosis, and control may be challenged since collected data may not reflect the actual operation. To fill this gap, this paper investigates the capability of three supervised machine learning classifiers, i.e., Support Vector Machine, Naïve Bayes, and K-Nearest Neighbors, that are tested by means of challenging analyses to infer general guidelines for UMI detection. Classification accuracy and posterior probability of each classifier is evaluated by means of an experimental dataset derived from a large fleet of Siemens gas turbines in operation. Results reveal that Naïve Bayes is the optimal classifier for UMI detection, since 88.5% of data are correctly labeled with 84% of posterior probability when experimental UMIs affect the dataset. In addition, Naïve Bayes proved to be the most robust classifier also if the rate of UMIs increases.

Keywords