Curvature-corrected learning dynamics in deep neural networks

被引:0
|
作者
Huh, Dongsung [1 ]
机构
[1] MIT IBM Watson AI Lab, Cambridge, MA 02142 USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119 | 2020年 / 119卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks exhibit complex learning dynamics due to their non-convex loss landscapes. Second-order optimization methods facilitate learning dynamics by compensating for ill-conditioned curvature. In this work, we investigate how curvature correction modifies the learning dynamics in deep linear neural networks and provide analytical solutions. We derive a generalized conservation law that preserves the path of parameter dynamics from curvature correction, which shows that curvature correction only modifies the temporal profiles of dynamics along the path. We show that while curvature correction accelerates the convergence dynamics of the input-output map, it can also negatively affect the generalization performance. Our analysis also reveals an undesirable effect of curvature correction that compromises stability of parameters dynamics during learning, especially with block-diagonal approximation of natural gradient descent. We introduce fractional curvature correction that resolves this problem while retaining most of the acceleration benefits of full curvature correction.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] CURVATURE-CORRECTED THERMOMETER
    不详
    ELECTRONICS WORLD & WIRELESS WORLD, 1991, 97 (1668): : 950 - 950
  • [2] A NEW CURVATURE-CORRECTED BANDGAP REFERENCE
    MEIJER, GCM
    SCHMALE, PC
    VANZALINGE, K
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1982, 17 (06) : 1139 - 1143
  • [3] FGMOS optimal curvature-corrected voltage reference
    Popa, Cosmin
    Circuits and Systems for Signal Processing , Information and Communication Technologies, and Power Sources and Systems, Vol 1 and 2, Proceedings, 2006, : 43 - 46
  • [4] A novel piecewise curvature-corrected CMOS bandgap reference
    Li Jing-hu
    Wang Yong-sheng
    Yu Ming-yan
    Ye Yi-zheng
    2008 7TH INTERNATIONAL CARIBBEAN CONFERENCE ON DEVICES, CIRCUITS AND SYSTEMS, 2008, : 1 - 5
  • [5] An improvement of a piecewise curvature-corrected CMOS bandgap reference
    Zawawi, Ruhaifi Abdullah
    Sidek, Othman
    Hassin, Wan Mohd Hafizi Wan
    Zulkipli, Mohamad Izat Amir
    Rhaffor, Nuha
    IEICE ELECTRONICS EXPRESS, 2011, 8 (22): : 1876 - 1881
  • [6] A Curvature-corrected Cartesian Grid Method and Its Application
    Sang, Wei-min
    Cai, Yang
    2015 INTERNATIONAL CONFERENCE ON MATERIALS AND ENGINEERING AND INDUSTRIAL APPLICATIONS (MEIA 2015), 2015, : 291 - 296
  • [7] Superior-Order Curvature-Corrected Logarithmic CMOS Nanostructure
    Popa, Cosmin
    ICQNM 2009: THIRD INTERNATIONAL CONFERENCE ON QUANTUM, NANO AND MICRO TECHNOLOGIES: PROCEEDINGS, 2009, : 130 - 133
  • [8] A new curvature-corrected CMOS bandgap voltage reference
    Zawawi, Ruhaifi Abdullah
    Sidek, Othman
    IEICE ELECTRONICS EXPRESS, 2012, 9 (04): : 240 - 244
  • [9] CURVATURE-CORRECTED VOLTAGE REFERENCE USING FGMOS DEVICES
    Popa, Cosmin
    EUROCON 2009: INTERNATIONAL IEEE CONFERENCE DEVOTED TO THE 150 ANNIVERSARY OF ALEXANDER S. POPOV, VOLS 1- 4, PROCEEDINGS, 2009, : 252 - 255
  • [10] A CURVATURE-CORRECTED LOW-VOLTAGE BANDGAP REFERENCE
    GUNAWAN, M
    MEIJER, GCM
    FONDERIE, J
    HUIJSING, JH
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1993, 28 (06) : 667 - 670