Nature Communications (Nov 2019)
The Eighty Five Percent Rule for optimal learning
Abstract
Is there an optimum difficulty level for training? In this paper, the authors show that for the widely-used class of stochastic gradient-descent based learning algorithms, learning is fastest when the accuracy during training is 85%.