The Hessian by blocks for neural network by backward propagation

Radhia Bessi; Nabil Gmati

doi:10.1080/16583655.2024.2327102

Journal of Taibah University for Science (Dec 2024)

The Hessian by blocks for neural network by backward propagation

Radhia Bessi,
Nabil Gmati

Affiliations

Radhia Bessi: LAMSIN, ENIT, Tunis, Tunisia
Nabil Gmati: College of sciences, Basic and Applied Scientific Research Center, Imam Abdulrahman Bin Faisal University, Dammam, Kingdom of Saudi Arabia

DOI: https://doi.org/10.1080/16583655.2024.2327102
Journal volume & issue: Vol. 18, no. 1

Abstract

Read online

The back-propagation algorithm used with a stochastic gradient and the increase in computer performance are at the origin of the recent Deep learning trend. For some problems, however, the convergence of gradient methods is still very slow. Newton's method offers potential advantages in terms of faster convergence. This method uses the Hessian matrix to guide the optimization process but increases the computational cost at each iteration. Indeed, although the expression of the Hessian matrix is explicitly known, previous work did not propose an efficient algorithm for its fast computation. In this work, we first propose a backward algorithm to compute the exact Hessian matrix. In addition, the introduction of original operators, for the calculation of second derivatives, facilitates the reading and allows the parallelization of the backward-looking algorithm. To study the practical performance of Newton's method, we apply the proposed algorithm to train two classical neural networks for regression and classification problems and display the associated numerical results.

Published in Journal of Taibah University for Science

ISSN: 1658-3655 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Science: Science (General)
Website: https://www.tandfonline.com/journals/tusc

About the journal

Abstract

Keywords