Journal of High Energy Physics (Feb 2024)

Anomaly detection in the presence of irrelevant features

  • Marat Freytsis,
  • Maxim Perelstein,
  • Yik Chuen San

DOI
https://doi.org/10.1007/JHEP02(2024)220
Journal volume & issue
Vol. 2024, no. 2
pp. 1 – 22

Abstract

Read online

Abstract Experiments at particle colliders are the primary source of insight into physics at microscopic scales. Searches at these facilities often rely on optimization of analyses targeting specific models of new physics. Increasingly, however, data-driven model-agnostic approaches based on machine learning are also being explored. A major challenge is that such methods can be highly sensitive to the presence of many irrelevant features in the data. This paper presents Boosted Decision Tree (BDT)-based techniques to improve anomaly detection in the presence of many irrelevant features. First, a BDT classifier is shown to be more robust than neural networks for the Classification Without Labels approach to finding resonant excesses assuming independence of resonant and non-resonant observables. Next, a tree-based probability density estimator using copula transformations demonstrates significant stability and improved performance over normalizing flows as irrelevant features are added. The results make a compelling case for further development of tree-based algorithms for more robust resonant anomaly detection in high energy physics.

Keywords