Approximating the Gradient of Cross-Entropy Loss Function

Li Li; Milos Doroslovacki; Murray H. Loew

doi:10.1109/ACCESS.2020.3001531

IEEE Access (Jan 2020)

Approximating the Gradient of Cross-Entropy Loss Function

Li Li,
Milos Doroslovacki,
Murray H. Loew

Affiliations

Li Li: ORCiD; Department of Electrical and Computer Engineering, The George Washington University, Washington, DC, USA
Milos Doroslovacki: Department of Electrical and Computer Engineering, The George Washington University, Washington, DC, USA
Murray H. Loew: Department of Electrical and Computer Engineering, The George Washington University, Washington, DC, USA

DOI: https://doi.org/10.1109/ACCESS.2020.3001531
Journal volume & issue: Vol. 8
pp. 111626 – 111635

Abstract

Read online

A loss function has two crucial roles in training a conventional discriminant deep neural network (DNN): (i) it measures the goodness of classification and (ii) generates the gradients that drive the training of the network. In this paper, we approximate the gradients of cross-entropy loss which is the most often used loss function in the classification DNNs. The proposed approximations are noise-free, which means they depend only on the labels of the training set. They have a fixed length to avoid the vanishing gradient problem of the cross-entropy loss. By skipping the forward pass, the computational complexities of the proposed approximations are reduced to O(n) where n is the batch size. Two claims are established based on the experiments of training DNNs using the proposed approximations: (i) It is possible to train a discriminant network without explicitly defining a loss function. (ii) The success of training does not imply the convergence of network parameters to fixed values. The experiments show that the proposed gradient approximations achieve comparable classification accuracy to the conventional loss functions and can accelerate the training process on multiple datasets.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords