Symmetric Tridiagonal Eigenvalue Solver Across CPU Graphics Processing Unit (GPU) Nodes

Erika Hernández-Rubio; Alberto Estrella-Cruz; Amilcar Meneses-Viveros; Jorge Alberto Rivera-Rivera; Liliana Ibeth Barbosa-Santillán; Sergio Víctor Chapa-Vergara

doi:10.3390/app142210716

Applied Sciences (Nov 2024)

Symmetric Tridiagonal Eigenvalue Solver Across CPU Graphics Processing Unit (GPU) Nodes

Erika Hernández-Rubio,
Alberto Estrella-Cruz,
Amilcar Meneses-Viveros,
Jorge Alberto Rivera-Rivera,
Liliana Ibeth Barbosa-Santillán,
Sergio Víctor Chapa-Vergara

Affiliations

Erika Hernández-Rubio: Sección de Estudios de Posgrado e Invetigación, Escuela Superior de Cómputo, Instituto Politécnico Nacional, Mexico City 07320, Mexico
Alberto Estrella-Cruz: Departamento de Computación, Cinvestav-IPN, Mexico City 07360, Mexico
Amilcar Meneses-Viveros: Departamento de Computación, Cinvestav-IPN, Mexico City 07360, Mexico
Jorge Alberto Rivera-Rivera: Escuela Superior de Cómputo, Instituto Politécnico Nacional, Mexico City 07320, Mexico
Liliana Ibeth Barbosa-Santillán: Departamento de Ciencias Computacionales, Instituto Tecnológico y de Estudios Superiores de Monterrey, Monterrey 45138, Mexico
Sergio Víctor Chapa-Vergara: Departamento de Computación, Cinvestav-IPN, Mexico City 07360, Mexico

DOI: https://doi.org/10.3390/app142210716
Journal volume & issue: Vol. 14, no. 22
p. 10716

Abstract

Read online

In this work, an improved and scalable implementation of Cuppen’s algorithm for diagonalizing symmetric tridiagonal matrices is presented. This approach uses a hybrid-heterogeneous parallelization technique, taking advantage of GPU and CPU in a distributed hardware architecture. Cuppen’s algorithm is a theoretical concept and a powerful tool in various scientific and engineering applications. It is a key player in matrix diagonalization, finding its use in Functional Density Theory (FDT) and Spectral Clustering. This highly efficient and numerically stable algorithm computes eigenvalues and eigenvectors of symmetric tridiagonal matrices, making it a crucial component in many computational methods. One of the challenges in parallelizing algorithms for GPUs is their limited memory capacity. However, we overcome this limitation by utilizing multiple nodes with both CPUs and GPUs. This enables us to solve subproblems that fit within the memory of each device in parallel and subsequently combine these subproblems to obtain the complete solution. The hybrid-heterogeneous approach proposed in this work outperforms the state-of-the-art libraries and also maintains a high degree of accuracy in terms of orthogonality and quality of eigenvectors. Furthermore, the sequential version of the algorithm with our approach in this work demonstrates superior performance and potential for practical use. In the experiments carried out, it was possible to verify that the performance of the implementation that was carried out scales by 2× using two graphic cards in the same node. Notably, Symmetric Tridiagonal Eigenvalue Solvers are fundamental to solving more general eigenvalue problems. Additionally, the divide-and-conquer approach employed in this implementation can be extended to singular value solvers. Given the wide range of eigenvalue problems encountered in scientific and engineering domains, this work is essential in advancing computational methods for efficient and accurate matrix diagonalization.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords