Journal of Hydroinformatics (Jul 2023)

GPU-parallelisation of Haar wavelet-based grid resolution adaptation for fast finite volume modelling: application to shallow water flows

  • Alovya Ahmed Chowdhury,
  • Georges Kesserwani,
  • Charles Rougé,
  • Paul Richmond

DOI
https://doi.org/10.2166/hydro.2023.154
Journal volume & issue
Vol. 25, no. 4
pp. 1210 – 1234

Abstract

Read online

Wavelet-based grid resolution adaptation driven by the ‘multiresolution analysis’ (MRA) of the Haar wavelet (HW) allows to devise an adaptive first-order finite volume (FV1) model (HWFV1) that can readily preserve the modelling fidelity of its reference uniform-grid FV1 counterpart. However, the MRA entails an enormous computational effort as it involves ‘encoding’ (coarsening), ‘decoding’ (refining), analysing and traversing modelled data across a deep hierarchy of nested, uniform grids. GPU-parallelisation of the MRA is needed to handle its computational effort, but its algorithmic structure (1) hinders coalesced memory access on the GPU and (2) involves an inherently sequential tree traversal problem. This work redesigns the algorithmic structure of the MRA in order to parallelise it on the GPU, addressing (1) by applying Z-order space-filling curves and (2) by adopting a parallel tree traversal algorithm. This results in a GPU-parallelised HWFV1 model (GPU-HWFV1). GPU-HWFV1 is verified against its CPU predecessor (CPU-HWFV1) and its GPU-parallelised reference uniform-grid counterpart (GPU-FV1) over five shallow water flow test cases. GPU-HWFV1 preserves the modelling fidelity of GPU-FV1 while being up to 30 times faster. Compared to CPU-HWFV1, it is up to 200 times faster, suggesting that the GPU-parallelised MRA could be used to speed up other FV1 models. HIGHLIGHTS Wavelet-based grid adaptation is parallelised on the GPU via a Z-order space-filling curve and a parallel tree traversal algorithm.; An adaptive Haar wavelet first-order finite volume shallow water model running on the GPU is developed (GPU-HWFV1).; GPU-HWFV1 is 20–300 times faster than its single-core serial CPU version 4.; GPU-HWFV1 is 1.3–30 times faster than its GPU-parallelised reference uniform-grid counterpart.;

Keywords