PHYLOGENETIC REPLAY LEARNING IN DEEP NEURAL NETWORKS

Jean-Patrice Glafkides; Gene I Sher; Herman Akdag

doi:10.5455/jjcit.71-1643583878

Jordanian Journal of Computers and Information Technology (Sep 2022)

PHYLOGENETIC REPLAY LEARNING IN DEEP NEURAL NETWORKS

Jean-Patrice Glafkides,
Gene I Sher,
Herman Akdag

Affiliations

Jean-Patrice Glafkides: PARAGRAPHE EA 349 - PARIS VIII University
Gene I Sher: 28 Bis Rue Legrand
Herman Akdag: PARAGRAPHE EA 349 - PARIS VIII University

DOI: https://doi.org/10.5455/jjcit.71-1643583878
Journal volume & issue: Vol. 8, no. 3
pp. 218 – 231

Abstract

Read online

Though substantial advancements have been made in training deep neural networks, one problem remains, the vanishing gradient. The very strength of deep neural networks, their depth, is also unfortunately their problem, due to the difficulty of thoroughly training the deeper layers due to the vanishing gradient. This paper proposes "Phylogenetic Replay Learning", a learning methodology that substantially alleviates the vanishing gradient problem. Unlike the residual learning methods, it does not restrict the structure of the model. Instead, it leverages elements from Neuroevolution, transfer learning, and layer by layer training. We demonstrate that this new approach is able to produce a better performing model, and by calculating Shannon Entropy of weights, we show that the deeper layers are trained much more thoroughly and contain statistically significantly more information than when a model is trained in a traditional brute force manner... [JJCIT 2022; 8(3.000): 218-231]

Published in Jordanian Journal of Computers and Information Technology

ISSN: 2413-9351 (Print); 2415-1076 (Online)
Publisher: Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)
Country of publisher: Jordan
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://jjcit.org/

About the journal

Abstract

Keywords