Layer-Wise Weight Decay for Deep Neural Networks

被引：3

作者：

Ishii, Masato ^{[1
]}

Sato, Atsushi ^{[1
]}

机构：

[1] NEC Data Sci Res Labs, Kawasaki, Kanagawa, Japan

来源：

IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017) | 2018年 / 10749卷

关键词：

D O I：

10.1007/978-3-319-75786-5_23

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose layer-wise weight decay for efficient training of deep neural networks. Our method sets different values of the weight-decay coefficients layer by layer so that the ratio of the scale of back-propagated gradients and that of the weight decay is constant throughout the network. By utilizing such a setting, we can avoid under or over-fitting and train all layers properly without having to tune the coefficients layer by layer. Experimental results show that our method can enhance the performance of existing deep neural networks without any change of network models.

引用

页码：276 / 289

页数：14

共 50 条

[1] Investigating Learning in Deep Neural Networks Using Layer-Wise Weight Change
Agrawal, Ayush Manish
Tendle, Atharva
Sikka, Harshvardhan
Singh, Sahib
Kayid, Amr
INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 678 - 693
[2] Stochastic Layer-Wise Precision in Deep Neural Networks
Lacey, Griffin
Taylor, Graham W.
Areibi, Shawki
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 663 - 672
[3] Differential Evolution Based Layer-Wise Weight Pruning for Compressing Deep Neural Networks
Wu, Tao
Li, Xiaoyang
Zhou, Deyun
Li, Na
Shi, Jiao
SENSORS, 2021, 21 (03) : 1 - 20
[4] Differential evolution based layer-wise weight pruning for compressing deep neural networks
Wu, Tao
Li, Xiaoyang
Zhou, Deyun
Li, Na
Shi, Jiao
Sensors (Switzerland), 2021, 21 (03): : 1 - 20
[5] Unsupervised Layer-Wise Model Selection in Deep Neural Networks
Ludovic, Arnold
Helene, Paugam-Moisy
Michele, Sebag
ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 915 - 920
[6] Collaborative Layer-Wise Discriminative Learning in Deep Neural Networks
Jin, Xiaojie
Chen, Yunpeng
Dong, Jian
Feng, Jiashi
Yan, Shuicheng
COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 733 - 749
[7] MLP in layer-wise form with applications to weight decay
Kärkkäinen, T
NEURAL COMPUTATION, 2002, 14 (06) : 1451 - 1480
[8] A layer-wise Perturbation based Privacy Preserving Deep Neural Networks
Adesuyi, Tosin A.
Kim, Byeong Man
2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 389 - 394
[9] LAYER-WISE INTERPRETATION OF DEEP NEURAL NETWORKS USING IDENTITY INITIALIZATION
Kubota, Shohei
Hayashi, Hideaki
Hayase, Tomohiro
Uchida, Seiichi
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3945 - 3949
[10] A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks
Huu-Thiet Nguyen
Li, Sitan
Cheah, Chien Chern
IEEE ACCESS, 2022, 10 : 14270 - 14287

← 1 2 3 4 5 →