EPJ Web of Conferences (Jan 2019)

Improving WLCG Networks Through Monitoring and Analytics

  • Babik Marian,
  • McKee Shawn,
  • Bockelman Brian Paul,
  • Fajardo Hernandez Edgar Mauricio,
  • Martelli Edoardo,
  • Vukotic Ilija,
  • Weitzel Derek,
  • Zvada Marian

DOI
https://doi.org/10.1051/epjconf/201921408006
Journal volume & issue
Vol. 214
p. 08006

Abstract

Read online

WLCG relies on the network as a critical part of its infrastructure and therefore needs to guarantee effective network usage and prompt detection and resolution of any network issues, including connection failures, congestion and traffic routing. OSG Networking Area in partnership with WLCG has focused on collecting, storing and making available all the network related metrics for further analysis and discovery of issues that might impact network performance and operations. In order to help sites and experiments better understand and fix the networking issues, WLCG Network Throughput working group was formed, which works on the analysis and integration of the network-related monitoring data collected by the OSG/WLCG infrastructure and operates a support unit to help find and fix the network performance issues. This paper describes the current state of the OSG network measurement platform and summarises the activities taken by the working group, including updates on the higher level services that were recently developed, network performance incidents investigated as well as past and present analytical activities related to networking and their results.