PHYLOGENETIC REPLAY LEARNING IN DEEP NEURAL NETWORKS

被引:1
作者
Glafkides, Jean-Patrice [1 ]
Sher, Gene, I [2 ]
Akdag, Herman [1 ]
机构
[1] PARIS VIII Univ, PARAGRAPHE EA 349, Paris, France
[2] DataValoris, Laramie, WY USA
来源
JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY | 2022年 / 8卷 / 03期
关键词
Neural networks; Neuroevolution; Phylogenetic replay learning; Deep learning; Vanishing gradient;
D O I
10.5455/jjcit.71-1643583878
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Though substantial advancements have been made in training deep neural networks, one problem remains, the vanishing gradient. The very strength of deep neural networks, their depth, is also unfortunately their problem, due to the difficulty of thoroughly training the deeper layers due to the vanishing gradient. This paper proposes "Phylogenetic Replay Learning", a learning methodology that substantially alleviates the vanishing-gradient problem. Unlike the residual learning methods, it does not restrict the structure of the model. Instead, it leverages elements from neuroevolution, transfer learning and layer-by-layer training. We demonstrate that this new approach is able to produce a better performing model and by calculating Shannon entropy of weights, we show that the deeper layers are trained much more thoroughly and contain statistically significantly more information than when a model is trained in a traditional brute force manner.
引用
收藏
页码:218 / 231
页数:14
相关论文
共 29 条
[1]  
[Anonymous], 2013, P 30 INT C MACH LEAR
[2]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[3]  
Conti E, 2018, ADV NEUR IN, V31
[4]  
De Nardi R, 2006, IEEE C EVOL COMPUTAT, P1784
[5]  
Gaier A, 2019, Arxiv, DOI [arXiv:1906.04358, 10.48550/ARXIV.1906.04358]
[6]  
Glorot X., 2011, P 14 INT C ART INT S, P315, DOI DOI 10.1002/ECS2.1832
[7]  
Glorot Xavier, 2010, AISTATS
[8]  
Gomez F, 2008, J MACH LEARN RES, V9, P937
[9]  
Granmo OC, 2019, Arxiv, DOI [arXiv:1905.09688, 10.48550/arXiv.1905.09688]
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778