Optimization of the Brillouin operator on the KNL architecture

Dürr Stephan

doi:10.1051/epjconf/201817502001

EPJ Web of Conferences (Jan 2018)

Optimization of the Brillouin operator on the KNL architecture

Dürr Stephan

Affiliations

Dürr Stephan

DOI: https://doi.org/10.1051/epjconf/201817502001
Journal volume & issue: Vol. 175
p. 02001

Abstract

Read online

Experiences with optimizing the matrix-times-vector application of the Brillouin operator on the Intel KNL processor are reported. Without adjustments to the memory layout, performance figures of 360 Gflop/s in single and 270 Gflop/s in double precision are observed. This is with Nc = 3 colors, Nv = 12 right-hand-sides, Nthr = 256 threads, on lattices of size 323 × 64, using exclusively OMP pragmas. Interestingly, the same routine performs quite well on Intel Core i7 architectures, too. Some observations on the much harderWilson fermion matrix-times-vector optimization problem are added.

Published in EPJ Web of Conferences

ISSN: 2100-014X (Online)
Publisher: EDP Sciences
Country of publisher: France
LCC subjects: Science: Physics
Website: http://www.epj-conferences.org/

About the journal