Representations and generalization in artificial and brain neural networks

被引:5
|
作者
Li, Qianyi [1 ,2 ]
Sorscher, Ben [3 ]
Sompolinsky, Haim [2 ,4 ]
机构
[1] Harvard Univ, Harvard Biophys Grad Program, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA
[3] Stanford Univ, Appl Phys Dept, Stanford, CA 94305 USA
[4] Hebrew Univ Jerusalem, Edmond & Lily Safra Ctr Brain Sci, IL-9190401 Jerusalem, Israel
关键词
deep neural networks; visual cortex; neural manifolds; few-shot learning; representational drift; GEOMETRY;
D O I
10.1073/pnas.2311805121
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Humans and animals excel at generalizing from limited data, a capability yet to be fully replicated in artificial intelligence. This perspective investigates generalization in biological and artificial deep neural networks (DNNs), in both in-distribution and out-of-distribution contexts. We introduce two hypotheses: First, the geometric properties of the neural manifolds associated with discrete cognitive entities, such as objects, words, and concepts, are powerful order parameters. They link the neural substrate to the generalization capabilities and provide a unified methodology bridging gaps between neuroscience, machine learning, and cognitive science. We overview recent progress in studying the geometry of neural manifolds, particularly in visual object recognition, and discuss theories connecting manifold dimension and radius to generalization capacity. Second, we suggest that the theory of learning in wide DNNs, especially in the thermodynamic limit, provides mechanistic insights into the learning processes generating desired neural representational geometries and generalization. This includes the role of weight norm regularization, network architecture, and hyper-parameters. We will explore recent advances in this theory and ongoing challenges. We also discuss the dynamics of learning and its relevance to the issue of representational drift in the brain.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] The Heuristic Work of the Brain and Artificial Neural Networks
    Eskov V.M.
    Pyatin V.F.
    Eskov V.V.
    Ilyashenko L.K.
    Biophysics, 2019, 64 (2) : 293 - 299
  • [22] Improving generalization of artificial neural networks in rainfall-runoff modelling
    Giustolisi, O
    Laucelli, D
    HYDROLOGICAL SCIENCES JOURNAL, 2005, 50 (03) : 439 - 457
  • [23] Identification versus generalization comment on the criticism of indeterminacy of artificial neural networks
    Holena, Martin
    APPLIED CATALYSIS A-GENERAL, 2008, 334 (1-2) : 381 - 385
  • [24] NEWRON: A New Generalization of the Artificial Neuron to Enhance the Interpretability of Neural Networks
    Siciliano, Federico
    Bucarelli, Maria Sofia
    Tolomei, Gabriele
    Silvestri, Fabrizio
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [25] Understanding Robustness and Generalization of Artificial Neural Networks Through Fourier Masks
    Karantzas, Nikos
    Besier, Emma
    Caro, Josue Ortega
    Pitkow, Xaq
    Tolias, Andreas S.
    Patel, Ankit B.
    Anselmi, Fabio
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [26] GENERALIZATION BY NEURAL NETWORKS
    SHEKHAR, S
    AMIN, MB
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1992, 4 (02) : 177 - 185
  • [27] On generalization by neural networks
    Kak, SC
    INFORMATION SCIENCES, 1998, 111 (1-4) : 293 - 302
  • [28] Relating Simple Sentence Representations in Deep Neural Networks and the Brain
    Jat, Sharmistha
    Tang, Hao
    Talukdar, Partha
    Mitchell, Tom
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5137 - 5154
  • [29] Artificial Neural Networks Using Quiver Representations of Finite Cyclic Groups
    Wanditra, Lucky Cahya
    Muchtadi-Alamsyah, Intan
    Nasution, Dellavitha
    SYMMETRY-BASEL, 2023, 15 (12):
  • [30] Shared spatiotemporal category representations in biological and artificial deep neural networks
    Greene, Michelle R.
    Hansen, Bruce C.
    PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (07)