Machine-learning-based particle identification with missing data

Miłosz Kasak; Kamil Deja; Maja Karwowska; Monika Jakubowska; Łukasz Graczykowski; Małgorzata Janik

doi:10.1140/epjc/s10052-024-13047-3

European Physical Journal C: Particles and Fields (Jul 2024)

Machine-learning-based particle identification with missing data

Miłosz Kasak,
Kamil Deja,
Maja Karwowska,
Monika Jakubowska,
Łukasz Graczykowski,
Małgorzata Janik

Affiliations

Miłosz Kasak: Warsaw University of Technology
Kamil Deja: Warsaw University of Technology
Maja Karwowska: Warsaw University of Technology
Monika Jakubowska: Warsaw University of Technology
Łukasz Graczykowski: Warsaw University of Technology
Małgorzata Janik: Warsaw University of Technology

DOI: https://doi.org/10.1140/epjc/s10052-024-13047-3
Journal volume & issue: Vol. 84, no. 7
pp. 1 – 11

Abstract

Read online

Abstract In this work, we introduce a novel method for Particle Identification (PID) within the scope of the ALICE experiment at the Large Hadron Collider at CERN. Identifying products of ultrarelativisitc collisions delivered by the LHC is one of the crucial objectives of ALICE. Typically employed PID methods rely on hand-crafted selections, which compare experimental data to theoretical simulations. To improve the performance of the baseline methods, novel approaches use machine learning models that learn the proper assignment in a classification task. However, because of the various detection techniques used by different subdetectors, as well as the limited detector efficiency and acceptance, produced particles do not always yield signals in all of the ALICE components. This results in data with missing values. Out of the box machine learning solutions cannot be trained with such examples without either modifying the training dataset or re-designing the model architecture. In this work, we propose the new method for PID that addresses these issues and can be trained with all of the available data examples, including incomplete ones. Our approach improves the PID purity and efficiency of the selected sample for all investigated particle species.

Published in European Physical Journal C: Particles and Fields

ISSN: 1434-6044 (Print); 1434-6052 (Online)
Publisher: SpringerOpen
Country of publisher: Germany
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics: Nuclear and particle physics. Atomic energy. Radioactivity
Website: http://www.springer.com/physics/particle+and+nuclear+physics/journal/10052

About the journal