Robust Learning with Implicit Residual Networks

Viktor Reshniak; Clayton G. Webster

doi:10.3390/make3010003

Machine Learning and Knowledge Extraction (Dec 2020)

Robust Learning with Implicit Residual Networks

Viktor Reshniak,
Clayton G. Webster

Affiliations

Viktor Reshniak: Data Analysis and Machine Learning, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Clayton G. Webster: Department of Mathematics, University of Tennessee at Knoxville, Knoxville, TN 37996, USA

DOI: https://doi.org/10.3390/make3010003
Journal volume & issue: Vol. 3, no. 1
pp. 34 – 55

Abstract

Read online

In this effort, we propose a new deep architecture utilizing residual blocks inspired by implicit discretization schemes. As opposed to the standard feed-forward networks, the outputs of the proposed implicit residual blocks are defined as the fixed points of the appropriately chosen nonlinear transformations. We show that this choice leads to the improved stability of both forward and backward propagations, has a favorable impact on the generalization power, and allows for control the robustness of the network with only a few hyperparameters. In addition, the proposed reformulation of ResNet does not introduce new parameters and can potentially lead to a reduction in the number of required layers due to improved forward stability. Finally, we derive the memory-efficient training algorithm, propose a stochastic regularization technique, and provide numerical results in support of our findings.

Published in Machine Learning and Knowledge Extraction

ISSN: 2504-4990 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware
Website: https://www.mdpi.com/journal/make

About the journal

Abstract

Keywords