EPJ Web of Conferences (Jan 2020)

EuroEXA Custom Switch: an innovative FPGA-based system for extreme scale computing in Europe

  • Biagioni Andrea,
  • Cretaro Paolo,
  • Frezza Ottorino,
  • Lo Cicero Francesca,
  • Lonardo Alessandro,
  • Paolucci Pier Stanislao,
  • Pontisso Luca,
  • Simula Francesco,
  • Vicini Piero

DOI
https://doi.org/10.1051/epjconf/202024509004
Journal volume & issue
Vol. 245
p. 09004

Abstract

Read online

EuroEXA is a major European FET research initiative that aims to deliver a proof-of-concept of a next generation Exa-scalable HPC platform. EuroEXA leverages on previous projects results (ExaNeSt, ExaNoDe and ECOSCALE) to design a medium scale but scalable, fully working HPC system prototype exploiting state-of-the-art FPGA devices that integrate compute accelerators and low-latency high-throughputnetwork. Exascale-class systems are expected to host a very large number of computing nodes, from 104 up to 105, so that capability and performances of the interconnect architecture are critical to achieve high computing efficiency at this scale. In this perspective, EuroEXA enhances the ExaNet architecture, inherited by the ExaNeSt project, and introduces a multi-tier, hybrid topology network built on top of an FPGA-integrated Custom Switch that provides high throughput and low inter-node traffic latency for the different layers of the network hierarchy. Deployment of a few testbeds is planned, with incremental complexity and equipped with complete software stack and runtime environment, to support the integration and test of the network design and to allow for evaluation of system performance and scalability through benchmarks based on real HPC applications. Design and integration activities are ongoing and the first small scale prototype (50 nodes) is expected to be completed in fall 2020 followed, one year later, by the deployment of the larger prototype (250/500 nodes).