FAIR AI models in high energy physics

Javier Duarte; Haoyang Li; Avik Roy; Ruike Zhu; E A Huerta; Daniel Diaz; Philip Harris; Raghav Kansal; Daniel S Katz; Ishaan H Kavoori; Volodymyr V Kindratenko; Farouk Mokhtar; Mark S Neubauer; Sang Eon Park; Melissa Quinnan; Roger Rusack; Zhizhen Zhao

doi:10.1088/2632-2153/ad12e3

Machine Learning: Science and Technology (Jan 2023)

FAIR AI models in high energy physics

Javier Duarte,
Haoyang Li,
Avik Roy,
Ruike Zhu,
E A Huerta,
Daniel Diaz,
Philip Harris,
Raghav Kansal,
Daniel S Katz,
Ishaan H Kavoori,
Volodymyr V Kindratenko,
Farouk Mokhtar,
Mark S Neubauer,
Sang Eon Park,
Melissa Quinnan,
Roger Rusack,
Zhizhen Zhao

Affiliations

Javier Duarte: ORCiD; University of California San Diego , La Jolla, CA 92093, United States of America
Haoyang Li: ORCiD; University of California San Diego , La Jolla, CA 92093, United States of America
Avik Roy: ORCiD; University of Illinois at Urbana-Champaign , Urbana, IL 61801, United States of America
Ruike Zhu: University of Illinois at Urbana-Champaign , Urbana, IL 61801, United States of America; Argonne National Laboratory , Lemont, IL 60439, United States of America
E A Huerta: ORCiD; Argonne National Laboratory , Lemont, IL 60439, United States of America; The University of Chicago , Chicago, IL 60637, United States of America
Daniel Diaz: ORCiD; University of California San Diego , La Jolla, CA 92093, United States of America
Philip Harris: ORCiD; Massachusetts Institute of Technology , Cambridge, MA 02139, United States of America
Raghav Kansal: ORCiD; University of California San Diego , La Jolla, CA 92093, United States of America
Daniel S Katz: ORCiD; University of Illinois at Urbana-Champaign , Urbana, IL 61801, United States of America
Ishaan H Kavoori: University of California San Diego , La Jolla, CA 92093, United States of America
Volodymyr V Kindratenko: ORCiD; University of Illinois at Urbana-Champaign , Urbana, IL 61801, United States of America
Farouk Mokhtar: ORCiD; University of California San Diego , La Jolla, CA 92093, United States of America; Halıcıoğlu Data Science Institute , La Jolla, CA 92093, United States of America
Mark S Neubauer: ORCiD; University of Illinois at Urbana-Champaign , Urbana, IL 61801, United States of America
Sang Eon Park: ORCiD; Massachusetts Institute of Technology , Cambridge, MA 02139, United States of America
Melissa Quinnan: ORCiD; University of California San Diego , La Jolla, CA 92093, United States of America
Roger Rusack: ORCiD; The University of Minnesota , Minneapolis, MN 55405, United States of America
Zhizhen Zhao: University of Illinois at Urbana-Champaign , Urbana, IL 61801, United States of America

DOI: https://doi.org/10.1088/2632-2153/ad12e3
Journal volume & issue: Vol. 4, no. 4
p. 045062

Abstract

Read online

The findable, accessible, interoperable, and reusable (FAIR) data principles provide a framework for examining, evaluating, and improving how data is shared to facilitate scientific discovery. Generalizing these principles to research software and other digital products is an active area of research. Machine learning models—algorithms that have been trained on data without being explicitly programmed—and more generally, artificial intelligence (AI) models, are an important target for this because of the ever-increasing pace with which AI is transforming scientific domains, such as experimental high energy physics (HEP). In this paper, we propose a practical definition of FAIR principles for AI models in HEP and describe a template for the application of these principles. We demonstrate the template’s use with an example AI model applied to HEP, in which a graph neural network is used to identify Higgs bosons decaying to two bottom quarks. We report on the robustness of this FAIR AI model, its portability across hardware architectures and software frameworks, and its interpretability.

Published in Machine Learning: Science and Technology

ISSN: 2632-2153 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://iopscience.iop.org/journal/2632-2153

About the journal

Abstract

Keywords