Subaging in underparametrized deep neural networks

被引:1
作者
Herrera Segura, Carolina [1 ]
Montoya, Edison [2 ,4 ]
Tapias, Diego [3 ]
机构
[1] Univ Antioquia, Inst Fis, Medellin, Colombia
[2] BCFort, Medellin, Colombia
[3] Univ Gottingen, Inst Theoret Phys, Gottingen, Germany
[4] Univ Antioquia, Medellin, Colombia
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2022年 / 3卷 / 03期
关键词
subaging; deep neural networks; glassy dynamics; underparametrized; STATISTICAL-MECHANICS; REGIMES; ENERGY;
D O I
10.1088/2632-2153/ac8f1b
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a simple classification problem to show that the dynamics of finite-width Deep Neural Networks in the underparametrized regime gives rise to effects similar to those associated with glassy systems, namely a slow evolution of the loss function and aging. Remarkably, the aging is sublinear in the waiting time (subaging) and the power-law exponent characterizing it is robust to different architectures under the constraint of a constant total number of parameters. Our results are maintained in the more complex scenario of the MNIST database. We find that for this database there is a unique exponent ruling the subaging behavior in the whole phase.
引用
收藏
页数:10
相关论文
共 50 条
[31]   The Representation of Speech in Deep Neural Networks [J].
Scharenborg, Odette ;
van der Gouw, Nikki ;
Larson, Martha ;
Marchiori, Elena .
MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 :194-205
[32]   Temporal Alignment for Deep Neural Networks [J].
Lin, Payton ;
Lyu, Dau-Cheng ;
Chang, Yun-Fan ;
Tsao, Yu .
2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, :108-112
[33]   Embedding Watermarks into Deep Neural Networks [J].
Uchida, Yusuke ;
Nagai, Yuki ;
Sakazawa, Shigeyuki ;
Satoh, Shin'ichi .
PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, :274-282
[34]   Digital watermarking for deep neural networks [J].
Nagai, Yuki ;
Uchida, Yusuke ;
Sakazawa, Shigeyuki ;
Satoh, Shin'ichi .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2018, 7 (01) :3-16
[35]   Weight normalized deep neural networks [J].
Xu, Yixi ;
Wang, Xiao .
STAT, 2021, 10 (01)
[36]   TRACK FINDING WITH DEEP NEURAL NETWORKS [J].
Kucharczyk, Marcin ;
Wolter, Marcin .
COMPUTER SCIENCE-AGH, 2019, 20 (04) :477-493
[37]   Stable architectures for deep neural networks [J].
Haber, Eldad ;
Ruthotto, Lars .
INVERSE PROBLEMS, 2018, 34 (01)
[38]   Polymorphic Accelerators for Deep Neural Networks [J].
Azizimazreah, Arash ;
Chen, Lizhong .
IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (03) :534-546
[39]   Design Index for Deep Neural Networks [J].
Date, Prasanna ;
Hendler, James A. ;
Carothers, Christopher D. .
7TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, (BICA 2016), 2016, 88 :131-138
[40]   Deep Neural Networks on Chip - A Survey [J].
Huo Yingge ;
Ali, Imran ;
Lee, Kang-Yoon .
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, :589-592