PHYLOGENETIC REPLAY LEARNING IN DEEP NEURAL NETWORKS

被引:1
|
作者
Glafkides, Jean-Patrice [1 ]
Sher, Gene, I [2 ]
Akdag, Herman [1 ]
机构
[1] PARIS VIII Univ, PARAGRAPHE EA 349, Paris, France
[2] DataValoris, Laramie, WY USA
来源
JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY | 2022年 / 8卷 / 03期
关键词
Neural networks; Neuroevolution; Phylogenetic replay learning; Deep learning; Vanishing gradient;
D O I
10.5455/jjcit.71-1643583878
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Though substantial advancements have been made in training deep neural networks, one problem remains, the vanishing gradient. The very strength of deep neural networks, their depth, is also unfortunately their problem, due to the difficulty of thoroughly training the deeper layers due to the vanishing gradient. This paper proposes "Phylogenetic Replay Learning", a learning methodology that substantially alleviates the vanishing-gradient problem. Unlike the residual learning methods, it does not restrict the structure of the model. Instead, it leverages elements from neuroevolution, transfer learning and layer-by-layer training. We demonstrate that this new approach is able to produce a better performing model and by calculating Shannon entropy of weights, we show that the deeper layers are trained much more thoroughly and contain statistically significantly more information than when a model is trained in a traditional brute force manner.
引用
收藏
页码:218 / 231
页数:14
相关论文
共 50 条
  • [41] Deep Neural Networks: Selected Aspects of Learning and Application
    V. A. Golovko
    A. A. Kroshchanka
    E. V. Mikhno
    Pattern Recognition and Image Analysis, 2021, 31 : 132 - 143
  • [42] A Kernel Analysis of Feature Learning in Deep Neural Networks
    Canatar, Abdulkadir
    Pehlevan, Cengiz
    2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
  • [43] Hebbian Learning Meets Deep Convolutional Neural Networks
    Amato, Giuseppe
    Carrara, Fabio
    Falchi, Fabrizio
    Gennaro, Claudio
    Lagani, Gabriele
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 324 - 334
  • [44] Learning accelerator of deep neural networks with logarithmic quantization
    Ueki, Takeo
    Iwai, Keisuke
    Matsubara, Takashi
    Kurokawa, Takakazu
    2018 7TH INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2018), 2018, : 634 - 638
  • [45] Theoretical Notes on Unsupervised Learning in Deep Neural Networks
    Golovko, Vladimir
    Kroshchanka, Aliaksandr
    PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, VOL 3: NCTA, 2016, : 91 - 96
  • [46] Deep Neural Networks: Selected Aspects of Learning and Application
    Golovko, V. A.
    Kroshchanka, A. A.
    Mikhno, E., V
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (01) : 132 - 143
  • [47] Learning ability of interpolating deep convolutional neural networks
    Zhou, Tian-Yi
    Huo, Xiaoming
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2024, 68
  • [48] A primer on deep learning and convolutional neural networks for clinicians
    Iglesias, Lara Lloret
    Bellon, Pablo Sanz
    del Barrio, Amaia Perez
    Fernandez-Miranda, Pablo Menendez
    Gonzalez, David Rodriguez
    Vega, Jose A.
    Mandly, Andres A. Gonzalez
    Blanco, Jose A. Parra
    INSIGHTS INTO IMAGING, 2021, 12 (01)
  • [49] Deep representation-based transfer learning for deep neural networks
    Yang, Tao
    Yu, Xia
    Ma, Ning
    Zhang, Yifu
    Li, Hongru
    KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [50] Learning viscoelasticity models from indirect data using deep neural networks
    Xu, Kailai
    Tartakovsky, Alexandre M.
    Burghardt, Jeff
    Darve, Eric
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 387