Machine Learning: Science and Technology (Jan 2023)

Fast kernel methods for data quality monitoring as a goodness-of-fit test

  • Gaia Grosso,
  • Nicolò Lai,
  • Marco Letizia,
  • Jacopo Pazzini,
  • Marco Rando,
  • Lorenzo Rosasco,
  • Andrea Wulzer,
  • Marco Zanetti

DOI
https://doi.org/10.1088/2632-2153/acebb7
Journal volume & issue
Vol. 4, no. 3
p. 035029

Abstract

Read online

We propose an accurate and efficient machine learning approach for monitoring particle detectors in real-time. The goal is to assess the compatibility of incoming experimental data with a reference dataset, characterising the data behaviour under normal circumstances, via a likelihood-ratio hypothesis test. The model is based on a modern implementation of kernel methods, nonparametric algorithms that can learn any continuous function given enough data. The resulting approach is efficient and agnostic to the type of anomaly that may be present in the data. Our study demonstrates the effectiveness of this strategy on multivariate data from drift tube chamber muon detectors.

Keywords