EPJ Web of Conferences (Jan 2019)

BigPanDA: PanDA Workload Management System and its Applications beyond ATLAS

  • Svirin Pavlo,
  • De Kaushik,
  • Forti Alessandra,
  • Klimentov Alexei,
  • Larsen Rasmus,
  • Love Peter,
  • Maeno Tadashi,
  • Mashinistov Ruslan,
  • Mukherjee Swagato,
  • Nomerotski Andrei,
  • Oleynik Danila,
  • Panitkin Sergey,
  • Park Hye Yun,
  • Sheldon Erin,
  • Slosar Anze,
  • Wells Jack,
  • Wenaus Torre

DOI
https://doi.org/10.1051/epjconf/201921403050
Journal volume & issue
Vol. 214
p. 03050

Abstract

Read online

Modern experiments collect peta-scale volumes of data and utilize vast, geographically distributed computing infrastructure that serves thousands of scientists around the world. Requirements for rapid, near real-time data processing, fast analysis cycles and need to run massive detector simulations to support data analysis pose special premium on efficient use of available computational resources. A sophisticated Workload Management System (WMS) is needed to coordinate the distribution and processing of data and jobs in such environment. The ATLAS experiment at CERN uses PanDA (Production and Data Analysis) Workload Management System for managing the workflow for all data processing on over 150 data centers. While PanDAcurrently uses more than 250,000 cores with a peak performance of 0.3 petaFLOPS, it runs around 2 million jobs per day on hundreds of Grid sites and serving thousands of ATLAS users. In 2017 about 1.5 exabytes of data were processed with PanDA.In 2012 BigPanDA project project was started with aim to introduce new types of computing resources into ATLAS computing infrastructure, but also to offering PanDA features to different data-intensive applications for projects and experiments outside of ATLAS and High-Energy and Nuclear Physics. In this article we will present accomplishments and discuss possible directions for future work.