Data-driven effective model shows a liquid-like deep learning

被引:3
|
作者
Zou, Wenxuan [1 ]
Huang, Haiping [1 ]
机构
[1] Sun Yat Sen Univ, Sch Phys, PMI Lab, Guangzhou 510275, Peoples R China
来源
PHYSICAL REVIEW RESEARCH | 2021年 / 3卷 / 03期
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
LANDSCAPE; NETWORKS;
D O I
10.1103/PhysRevResearch.3.033290
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The geometric structure of an optimization landscape is argued to be fundamentally important to support the success of deep neural network learning. A direct computation of the landscape beyond two layers is hard. Therefore, to capture the global view of the landscape, an interpretable model of the network-parameter (or weight) space must be established. However, the model is lacking so far. Furthermore, it remains unknown what the landscape looks like for deep networks of binary synapses, which plays a key role in robust and energy efficient neuromorphic computation. Here, we propose a statistical mechanics framework by directly building a least structured model of the high-dimensional weight space, considering realistic structured data, stochastic gradient descent training, and the computational depth of neural networks. We also consider whether the number of network parameters outnumbers the number of supplied training data, namely, over- or under-parametrization. Our least structured model reveals that the weight spaces of the under-parametrization and over-parameterization cases belong to the same class, in the sense that these weight spaces are well connected without any hierarchical clustering structure. In contrast, the shallow-network has a broken weight space, characterized by a discontinuous phase transition, thereby clarifying the benefit of depth in deep learning from the angle of high-dimensional geometry. Our effective model also reveals that inside a deep network, there exists a liquid-like central part of the architecture in the sense that the weights in this part behave as randomly as possible, providing algorithmic implications. Our data-driven model thus provides a statistical mechanics insight about why deep learning is unreasonably effective in terms of the high-dimensional weight space, and how deep networks are different from shallow ones.
引用
收藏
页数:19
相关论文
共 50 条
  • [11] Data-driven fault detection and isolation of nonlinear systems using deep learning for Koopman operator
    Bakhtiaridoust, Mohammadhosein
    Yadegar, Meysam
    Meskin, Nader
    ISA TRANSACTIONS, 2023, 134 : 200 - 211
  • [12] CRISPRnet: deep learning and data-driven CRISPR design for network-based multiplexed targeting
    Roncelli, Stefano
    Corsi, Giulia
    Anthon, Christian
    Luo, Yonglun
    Gorodkin, Jan
    Jensen, Lars Juhl
    FEBS OPEN BIO, 2024, 14 : 23 - 23
  • [13] Reproducing scientists' mobility: a data-driven model
    Vaccario, Giacomo
    Verginer, Luca
    Schweitzer, Frank
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [14] Data-Driven Compression and Efficient Learning of the Choquet Integral
    Islam, Muhammad Aminul
    Anderson, Derek T.
    Pinar, Anthony J.
    Havens, Timothy C.
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (04) : 1908 - 1922
  • [15] Deep learning architectures for data-driven damage detection in nonlinear dynamic systems under random vibrations
    Joseph, Harrish
    Quaranta, Giuseppe
    Carboni, Biagio
    Lacarbonara, Walter
    NONLINEAR DYNAMICS, 2024, 112 (23) : 20611 - 20636
  • [16] A data-driven computational model on the effects of immigration policies
    Simon, Miranda
    Schwartz, Cassilde
    Hudson, David
    Johnson, Shane D.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (34) : E7914 - E7923
  • [17] Occupational mobility and automation: a data-driven network model
    del Rio-Chanona, R. Maria
    Mealy, Penny
    Beguerisse-Diaz, Mariano
    Lafond, Francois
    Farmer, J. Doyne
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2021, 18 (174)
  • [18] Data-driven turbulence model for unsteady cavitating flow
    Zhang, Zhen
    Wang, Jingzhu
    Huang, Renfang
    Qiu, Rundi
    Chu, Xuesen
    Ye, Shuran
    Wang, Yiwei
    Liu, Qingkuan
    PHYSICS OF FLUIDS, 2023, 35 (01)
  • [19] Learning to Sample: Data-Driven Sampling and Reconstruction of FRI Signals
    Mulleti, Satish
    Zhang, Haiyang
    Eldar, Yonina C. C.
    IEEE ACCESS, 2023, 11 : 71048 - 71062
  • [20] A Data-Driven Iterative Learning Algorithm for Robot Kinematics Approximation
    Huu-Thiet Nguyen
    Cheah, Chien Chern
    Toh, Kar-Ann
    2019 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2019, : 1031 - 1036