Enhancing deep neural network training efficiency and performance through linear prediction

Hejie Ying; Mengmeng Song; Yaohong Tang; Shungen Xiao; Zimin Xiao

doi:10.1038/s41598-024-65691-0

Scientific Reports (Jul 2024)

Enhancing deep neural network training efficiency and performance through linear prediction

Hejie Ying,
Mengmeng Song,
Yaohong Tang,
Shungen Xiao,
Zimin Xiao

Affiliations

Hejie Ying: Ningde Normal University
Mengmeng Song: Ningde Normal University
Yaohong Tang: Ningde Normal University
Shungen Xiao: Ningde Normal University
Zimin Xiao: Ningde Normal University

DOI: https://doi.org/10.1038/s41598-024-65691-0
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Deep neural networks have achieved remarkable success in various fields. However, training an effective deep neural network still poses challenges. This paper aims to propose a method to optimize the training effectiveness of deep neural networks, with the goal of improving their performance. Firstly, based on the observation that parameters (weights and bias) of deep neural network change in certain rules during training process, the potential of parameters prediction for improving training efficiency is discovered. Secondly, the potential of parameters prediction to improve the performance of deep neural network by noise injection introduced by prediction errors is revealed. And then, considering the limitations comprehensively, a deep neural network Parameters Linear Prediction method is exploit. Finally, performance and hyperparameter sensitivity validations are carried out on some representative backbones. Experimental results show that by employing proposed Parameters Linear Prediction method, as opposed to SGD, has led to an approximate 1% increase in accuracy for optimal model, along with a reduction of about 0.01 in top-1/top-5 error. Moreover, it also exhibits stable performance under various hyperparameter settings, shown the effectiveness of the proposed method and validated its capacity in enhancing network’s training efficiency and performance.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords