EPJ Web of Conferences (Jan 2021)

Prototype of the Russian Scientific Data Lake

  • Alekseev Aleksandr,
  • Espinal Xavier,
  • Jezequel Stephane,
  • Kiryanov Andrey,
  • Klimentov Alexei,
  • Korchuganova Tatiana,
  • Mitsyn Valeri,
  • Oleynik Danila,
  • Smirnov Alexander,
  • Smirnov Sergei,
  • Zarochentsev Andrey

DOI
https://doi.org/10.1051/epjconf/202125102031
Journal volume & issue
Vol. 251
p. 02031

Abstract

Read online

The High Luminosity phase of the LHC, which aims for a tenfold increase in the luminosity of proton-proton collisions is expected to start operation in eight years. An unprecedented scientific data volume at the multiexabyte scale will be delivered to particle physics experiments at CERN. This amount of data has to be stored and the corresponding technology must ensure fast and reliable data delivery for processing by the scientific community all over the world. The present LHC computing model will not be able to provide the required infrastructure growth even taking into account the expected hardware evolution. To address this challenge the Data Lake R&D project has been launched by the DOMA community in the fall of 2019. State-of-the-art data handling technologies are under active development, and their current status for the Russian Scientific Data Lake prototype is presented here.