PHYLOGENETIC REPLAY LEARNING IN DEEP NEURAL NETWORKS

被引：1

作者：

Glafkides, Jean-Patrice ^{[1
]}

Sher, Gene, I ^{[2
]}

Akdag, Herman ^{[1
]}

机构：

[1] PARIS VIII Univ, PARAGRAPHE EA 349, Paris, France

[2] DataValoris, Laramie, WY USA

来源：

JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY | 2022年 / 8卷 / 03期

关键词：

Neural networks; Neuroevolution; Phylogenetic replay learning; Deep learning; Vanishing gradient;

D O I：

10.5455/jjcit.71-1643583878

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Though substantial advancements have been made in training deep neural networks, one problem remains, the vanishing gradient. The very strength of deep neural networks, their depth, is also unfortunately their problem, due to the difficulty of thoroughly training the deeper layers due to the vanishing gradient. This paper proposes "Phylogenetic Replay Learning", a learning methodology that substantially alleviates the vanishing-gradient problem. Unlike the residual learning methods, it does not restrict the structure of the model. Instead, it leverages elements from neuroevolution, transfer learning and layer-by-layer training. We demonstrate that this new approach is able to produce a better performing model and by calculating Shannon entropy of weights, we show that the deeper layers are trained much more thoroughly and contain statistically significantly more information than when a model is trained in a traditional brute force manner.

引用

页码：218 / 231

页数：14

共 50 条

[41] Deep Neural Networks: Selected Aspects of Learning and Application
V. A. Golovko
A. A. Kroshchanka
E. V. Mikhno
Pattern Recognition and Image Analysis, 2021, 31 : 132 - 143
[42] A Kernel Analysis of Feature Learning in Deep Neural Networks
Canatar, Abdulkadir
Pehlevan, Cengiz
2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
[43] Hebbian Learning Meets Deep Convolutional Neural Networks
Amato, Giuseppe
Carrara, Fabio
Falchi, Fabrizio
Gennaro, Claudio
Lagani, Gabriele
IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 324 - 334
[44] Learning accelerator of deep neural networks with logarithmic quantization
Ueki, Takeo
Iwai, Keisuke
Matsubara, Takashi
Kurokawa, Takakazu
2018 7TH INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2018), 2018, : 634 - 638
[45] Theoretical Notes on Unsupervised Learning in Deep Neural Networks
Golovko, Vladimir
Kroshchanka, Aliaksandr
PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, VOL 3: NCTA, 2016, : 91 - 96
[46] Deep Neural Networks: Selected Aspects of Learning and Application
Golovko, V. A.
Kroshchanka, A. A.
Mikhno, E., V
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (01) : 132 - 143
[47] Learning ability of interpolating deep convolutional neural networks
Zhou, Tian-Yi
Huo, Xiaoming
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2024, 68
[48] A primer on deep learning and convolutional neural networks for clinicians
Iglesias, Lara Lloret
Bellon, Pablo Sanz
del Barrio, Amaia Perez
Fernandez-Miranda, Pablo Menendez
Gonzalez, David Rodriguez
Vega, Jose A.
Mandly, Andres A. Gonzalez
Blanco, Jose A. Parra
INSIGHTS INTO IMAGING, 2021, 12 (01)
[49] Deep representation-based transfer learning for deep neural networks
Yang, Tao
Yu, Xia
Ma, Ning
Zhang, Yifu
Li, Hongru
KNOWLEDGE-BASED SYSTEMS, 2022, 253
[50] Learning viscoelasticity models from indirect data using deep neural networks
Xu, Kailai
Tartakovsky, Alexandre M.
Burghardt, Jeff
Darve, Eric
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 387

← 1 2 3 4 5 →