Faster RooFitting: Automated parallel calculation of collaborative statistical models

Bos E G Patrick; Burgard Carsten D; Croft Vincent A.; Hageboeck Stephan; Moneta Lorenzo; Pelupessy Inti; Attema Jisk J; Verkerke Wouter

doi:10.1051/epjconf/202024506027

EPJ Web of Conferences (Jan 2020)

Faster RooFitting: Automated parallel calculation of collaborative statistical models

Bos E G Patrick,
Burgard Carsten D,
Croft Vincent A.,
Hageboeck Stephan,
Moneta Lorenzo,
Pelupessy Inti,
Attema Jisk J,
Verkerke Wouter

Affiliations

Bos E G Patrick: Netherlands eScience Center
Burgard Carsten D: ATLAS group, Nikhef
Croft Vincent A.: Dept. of Physics and Astronomy, Tufts University
Hageboeck Stephan: ROOT Development Team, CERN
Moneta Lorenzo: ROOT Development Team, CERN
Pelupessy Inti: Netherlands eScience Center
Attema Jisk J: Netherlands eScience Center
Verkerke Wouter: ATLAS group, Nikhef

DOI: https://doi.org/10.1051/epjconf/202024506027
Journal volume & issue: Vol. 245
p. 06027

Abstract

Read online

RooFit [1, 2] is the main statistical modeling and fitting package used to extract physical parameters from reduced particle collision data, e.g. the Higgs boson experiments at the LHC [3, 4]. RooFit aims to separate particle physics model building and fitting (the users’ goals) from their technical implementation and optimization in the back-end. In this paper, we outline our efforts to further optimize this back-end by automatically running parts of user models in parallel on multi-core machines. A major challenge is that RooFit allows users to define many different types of models, with different types of computational bottlenecks. Our automatic parallelization framework must then be flexible, while still reducing run time by at least an order of magnitude, preferably more. We have performed extensive benchmarks and identified at least three bottlenecks that will benefit from parallelization. We designed a parallelization framework that allows us to parallelize likelihood minimization with high performance by splitting over partial derivatives in the minimizer. The basis of the framework is a task queue approach. Preliminary results show speed-ups of factor 2 to 20, depending on the exact model and parallelization strategy.

Published in EPJ Web of Conferences

ISSN: 2100-014X (Online)
Publisher: EDP Sciences
Country of publisher: France
LCC subjects: Science: Physics
Website: http://www.epj-conferences.org/

About the journal