Representations and generalization in artificial and brain neural networks

被引:5
|
作者
Li, Qianyi [1 ,2 ]
Sorscher, Ben [3 ]
Sompolinsky, Haim [2 ,4 ]
机构
[1] Harvard Univ, Harvard Biophys Grad Program, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA
[3] Stanford Univ, Appl Phys Dept, Stanford, CA 94305 USA
[4] Hebrew Univ Jerusalem, Edmond & Lily Safra Ctr Brain Sci, IL-9190401 Jerusalem, Israel
关键词
deep neural networks; visual cortex; neural manifolds; few-shot learning; representational drift; GEOMETRY;
D O I
10.1073/pnas.2311805121
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Humans and animals excel at generalizing from limited data, a capability yet to be fully replicated in artificial intelligence. This perspective investigates generalization in biological and artificial deep neural networks (DNNs), in both in-distribution and out-of-distribution contexts. We introduce two hypotheses: First, the geometric properties of the neural manifolds associated with discrete cognitive entities, such as objects, words, and concepts, are powerful order parameters. They link the neural substrate to the generalization capabilities and provide a unified methodology bridging gaps between neuroscience, machine learning, and cognitive science. We overview recent progress in studying the geometry of neural manifolds, particularly in visual object recognition, and discuss theories connecting manifold dimension and radius to generalization capacity. Second, we suggest that the theory of learning in wide DNNs, especially in the thermodynamic limit, provides mechanistic insights into the learning processes generating desired neural representational geometries and generalization. This includes the role of weight norm regularization, network architecture, and hyper-parameters. We will explore recent advances in this theory and ongoing challenges. We also discuss the dynamics of learning and its relevance to the issue of representational drift in the brain.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Compositional generalization through abstract representations in human and artificial neural networks
    Ito, Takuya
    Klinger, Tim
    Schultz, Douglas H.
    Murray, John D.
    Cole, Michael W.
    Rigotti, Mattia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] On the similarities of representations in artificial and brain neural networks for speech recognition
    Wingfield, Cai
    Zhang, Chao
    Devereux, Barry
    Fonteneau, Elisabeth
    Thwaites, Andrew
    Liu, Xunying
    Woodland, Phil
    Marslen-Wilson, William
    Su, Li
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2022, 16
  • [3] GENERALIZATION AND SPECIALIZATION IN ARTIFICIAL NEURAL NETWORKS
    HAMPSON, S
    PROGRESS IN NEUROBIOLOGY, 1991, 37 (05) : 383 - 431
  • [4] Redundant representations help generalization in wide neural networks
    Doimo, Diego
    Glielmo, Aldo
    Goldt, Sebastian
    Laio, Alessandro
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Redundant representations help generalization in wide neural networks
    Doimo, Diego
    Glielmo, Aldo
    Goldt, Sebastian
    Laio, Alessandro
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2023, 2023 (11):
  • [6] Learning flat representations with artificial neural networks
    Vlad Constantinescu
    Costin Chiru
    Tudor Boloni
    Adina Florea
    Robi Tacutu
    Applied Intelligence, 2021, 51 : 2456 - 2470
  • [7] Learning flat representations with artificial neural networks
    Constantinescu, Vlad
    Chiru, Costin
    Boloni, Tudor
    Florea, Adina
    Tacutu, Robi
    APPLIED INTELLIGENCE, 2021, 51 (04) : 2456 - 2470
  • [8] Linguistic generalization and compositionality in modern artificial neural networks
    Baroni, Marco
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2020, 375 (1791)
  • [9] Investigating generalization in parallel evolutionary artificial neural networks
    Davoian, Kristina
    Lippe, Wolfram-M.
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2007, : 90 - +
  • [10] Statistical physics and representations in real and artificial neural networks
    Cocco, S.
    Monasson, R.
    Posani, L.
    Rosay, S.
    Tubiana, J.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 504 : 45 - 76