EPJ Web of Conferences (Jan 2024)

Analyzing, Identifying & Alerting on Network Issues

  • Vasileva Petya,
  • Babik Marian,
  • McKee Shawn,
  • Vukotic Ilija

DOI
https://doi.org/10.1051/epjconf/202429507003
Journal volume & issue
Vol. 295
p. 07003

Abstract

Read online

The Worldwide LHC Computing Grid (WLCG) relies on the network as a critical part of its infrastructure and therefore needs to guarantee effective network usage and prompt detection and resolution of any network issues, including connection failures, congestion, and traffic routing. In this paper, we will describe our ongoing work to proactively analyze, correlate and alert on various network and infrastructure issues. We will discuss the methods and techniques applied, the systems developed, and the challenges with the measurements that make it difficult to easily identify problems or assign those problems to the appropriate location(s).