EPJ Web of Conferences (Jan 2020)

CERN Analysis Preservation and Reuse Framework: FAIR research data services for LHC experiments

  • Fokianos Pamfilos,
  • Feger Sebastian,
  • Koutsakis Ilias,
  • Lavasa Artemis,
  • Maciulaitis Rokas,
  • Naim Kamran,
  • Okraska Jan,
  • Papadopoulos Antonios,
  • Rodríguez Diego,
  • Šimko Tibor,
  • Trzcinska Anna,
  • Tsanaktsidis Ioannis,
  • van de Sandt Stephanie

DOI
https://doi.org/10.1051/epjconf/202024506011
Journal volume & issue
Vol. 245
p. 06011

Abstract

Read online

In this paper we present the CERN Analysis Preservation service as a FAIR (Findable, Accessible, Interoperable and Reusable) research data preservation repository platform for LHC experiments. The CERN Analysis Preservation repository allows LHC collaborations to deposit and share the structured information about analyses as well as to capture the individual data assets associated to the analysis. We describe the typical data ingestion pipelines, through which an individual physicist can preserve and share their final n-tuples, ROOT macros, Jupyter notebooks, or even their full analysis workflow code and any intermediate datasets of interest for preservation within the restricted context of experimental collaboration. We discuss the importance of annotating the deposited content with high-level structured information about physics concepts in order to promote information discovery and knowledge sharing inside the collaboration. Finally, we describe techniques used to facilitate the reusability of preserved data assets by capturing and re-executing reproducible recipes and computational workflows using the REANA Reusable Analysis platform.