EPJ Web of Conferences (Jan 2020)

Extension of the INFN Tier-1 on a HPC system

  • Boccali Tommaso,
  • Dal Pra Stefano,
  • Spiga Daniele,
  • Ciangottini Diego,
  • Zani Stefano,
  • Bozzi Concezio,
  • De Salvo Alessandro,
  • Valassi Andrea,
  • Noferini Francesco,
  • dell’Agnello Luca,
  • Stagni Federico,
  • Doria Alessandra,
  • Bonacorsi Daniele

DOI
https://doi.org/10.1051/epjconf/202024509009
Journal volume & issue
Vol. 245
p. 09009

Abstract

Read online

The INFN Tier-1 located at CNAF in Bologna (Italy) is a center of the WLCG e-Infrastructure, supporting the 4 major LHC collaborations and more than 30 other INFN-related experiments. After multiple tests towards elastic expansion of CNAF compute power via Cloud resources (provided by Azure, Aruba and in the framework of the HNSciCloud project), and building on the experience gained with the production quality extension of the Tier-1 farm on remote owned sites, the CNAF team, in collaboration with experts from the ALICE, ATLAS, CMS, and LHCb experiments, has been working to put in production a solution of an integrated HTC+HPC system with the PRACE CINECA center, located nearby Bologna. Such extension will be implemented on the Marconi A2 partition, equipped with Intel Knights Landing (KNL) processors. A number of technical challenges were faced and solved in order to successfully run on low RAM nodes, as well as to overcome the closed environment (network, access, software distribution, … ) that HPC systems deploy with respect to standard GRID sites. We show preliminary results from a large scale integration effort, using resources secured via the successful PRACE grant N. 2018194658, for 30 million KNL core hours.