Smart Caching at CMS: applying AI to XCache edge services

Spiga Daniele; Ciangottini Diego; Tracolli Mirco; Tedeschi Tommaso; Cesini Daniele; Boccali Tommaso; Poggioni Valentina; Baioletti Marco; Kuznetsov Valentin Y.

doi:10.1051/epjconf/202024504024

EPJ Web of Conferences (Jan 2020)

Smart Caching at CMS: applying AI to XCache edge services

Spiga Daniele,
Ciangottini Diego,
Tracolli Mirco,
Tedeschi Tommaso,
Cesini Daniele,
Boccali Tommaso,
Poggioni Valentina,
Baioletti Marco,
Kuznetsov Valentin Y.

Affiliations

Spiga Daniele: INFN Sezione di Perugia
Ciangottini Diego: INFN Sezione di Perugia
Tracolli Mirco
Tedeschi Tommaso
Cesini Daniele: INFN-CNAF
Boccali Tommaso: INFN Sezione di Pisa
Poggioni Valentina: Università degli Studi di Perugia
Baioletti Marco: Università degli Studi di Perugia
Kuznetsov Valentin Y.: Cornell University

DOI: https://doi.org/10.1051/epjconf/202024504024
Journal volume & issue: Vol. 245
p. 04024

Abstract

Read online

The projected Storage and Compute needs for the HL-LHC will be a factor up to 10 above what can be achieved by the evolution of current technology within a flat budget. The WLCG community is studying possible technical solutions to evolve the current computing in order to cope with the requirements; one of the main focus is resource optimization, with the ultimate aim of improving performance and efficiency, as well as simplifying and reducing operation costs. As of today the storage consolidation based on a Data Lake model is considered a good candidate for addressing HL-LHC data access challenges. The Data Lake model under evaluation can be seen as a logical system that hosts a distributed working set of analysis data. Compute power can be “close” to the lake, but also remote and thus completely external. In this context we expect data caching to play a central role as a technical solution to reduce the impact of latency and reduce network load. A geographically distributed caching layer will be functional to many satellite computing centers that might appear and disappear dynamically. In this talk we propose a system of caches, distributed at national level, describing both deployment and results of the studies made to measure the impact on the CPU efficiency. In this contribution, we also present the early results on novel caching strategy beyond the standard XRootD approach whose results will be a baseline for an AI-based smart caching system.

Published in EPJ Web of Conferences

ISSN: 2100-014X (Online)
Publisher: EDP Sciences
Country of publisher: France
LCC subjects: Science: Physics
Website: http://www.epj-conferences.org/

About the journal