EPJ Web of Conferences (Jan 2020)

Moving the California distributed CMS XCache from bare metal into containers using Kubernetes

  • Fajardo Edgar,
  • Tadel Matevz,
  • Balcas Justas,
  • Tadel Alja,
  • Würthwein Frank,
  • Davila Diego,
  • Guiang Jonathan,
  • Sfiligoi Igor

DOI
https://doi.org/10.1051/epjconf/202024504042
Journal volume & issue
Vol. 245
p. 04042

Abstract

Read online

The University of California system maintains excellent networking between its campuses and a number of other Universities in California, including Caltech, most of them being connected at 100 Gbps. UCSD and Caltech Tier2 centers have joined their disk systems into a single logical caching system, with worker nodes from both sites accessing data from disks at either site. This successful setup has been in place for the last two years. However, coherently managing nodes at multiple physical locations is not trivial and requires an update on the operations model used. The Pacific Research Platform (PRP) provides Kubernetes resource pool spanning resources in the science demilitarized zones (DMZs) in several campuses in California and worldwide. We show how we migrated the XCache services from bare-metal deployments into containers using the PRP cluster. This paper presents the reasoning behind our hardware decisions and the experience in migrating to and operating in a mixed environment.