Representations and generalization in artificial and brain neural networks

被引:5
|
作者
Li, Qianyi [1 ,2 ]
Sorscher, Ben [3 ]
Sompolinsky, Haim [2 ,4 ]
机构
[1] Harvard Univ, Harvard Biophys Grad Program, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA
[3] Stanford Univ, Appl Phys Dept, Stanford, CA 94305 USA
[4] Hebrew Univ Jerusalem, Edmond & Lily Safra Ctr Brain Sci, IL-9190401 Jerusalem, Israel
关键词
deep neural networks; visual cortex; neural manifolds; few-shot learning; representational drift; GEOMETRY;
D O I
10.1073/pnas.2311805121
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Humans and animals excel at generalizing from limited data, a capability yet to be fully replicated in artificial intelligence. This perspective investigates generalization in biological and artificial deep neural networks (DNNs), in both in-distribution and out-of-distribution contexts. We introduce two hypotheses: First, the geometric properties of the neural manifolds associated with discrete cognitive entities, such as objects, words, and concepts, are powerful order parameters. They link the neural substrate to the generalization capabilities and provide a unified methodology bridging gaps between neuroscience, machine learning, and cognitive science. We overview recent progress in studying the geometry of neural manifolds, particularly in visual object recognition, and discuss theories connecting manifold dimension and radius to generalization capacity. Second, we suggest that the theory of learning in wide DNNs, especially in the thermodynamic limit, provides mechanistic insights into the learning processes generating desired neural representational geometries and generalization. This includes the role of weight norm regularization, network architecture, and hyper-parameters. We will explore recent advances in this theory and ongoing challenges. We also discuss the dynamics of learning and its relevance to the issue of representational drift in the brain.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] View-symmetric representations of faces in human and artificial neural networks
    Zhu, Xun
    Watson, David M.
    Rogers, Daniel
    Andrews, Timothy J.
    NEUROPSYCHOLOGIA, 2025, 207
  • [32] How can artificial neural networks approximate the brain?
    Shao, Feng
    Shen, Zheng
    FRONTIERS IN PSYCHOLOGY, 2023, 13
  • [33] Classification of brain tumours using artificial neural networks
    Rao, B. V. Subba
    Kondaveti, Raja
    Prasad, R. V. V. S. V.
    Shanmukha, V.
    Sastry, K. B. S.
    Dasaradharam, Bh.
    ACTA IMEKO, 2022, 11 (01):
  • [34] Estimation of brain connectivity through Artificial Neural Networks
    Antonacci, Yuri
    Toppi, Jlenia
    Mattia, Donatella
    Pietrabissa, Antonio
    Astolfi, Laura
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 636 - 639
  • [35] Generalization theory and generalization methods for neural networks
    Wei, Hai-Kun
    Xu, Si-Xin
    Song, Wen-Zhong
    Zidonghua Xuebao/Acta Automatica Sinica, 2001, 27 (06): : 806 - 815
  • [36] On the generalization capability of artificial neural networks used to estimate fretting fatigue life
    Oliveira, Giorgio Andre Brito
    Cardoso, Raphael Araujo
    Freire Junior, Raimundo Carlos Silverio
    Doca, Thiago
    Araujo, Jose Alexander
    TRIBOLOGY INTERNATIONAL, 2024, 192
  • [37] Probing neural representations of scene perception in a hippocampally dependent task using artificial neural networks
    Frey, Markus
    Doeller, Christian F.
    Barry, Caswell
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2113 - 2121
  • [38] Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain
    Zhang, Chi
    Duan, Xiao-Han
    Wang, Lin-Yuan
    Li, Yong-Li
    Yan, Bin
    Hu, Guo-En
    Zhang, Ru-Yuan
    Tong, Li
    FRONTIERS IN NEUROINFORMATICS, 2021, 15
  • [39] WEIGHT SPACE STRUCTURE AND INTERNAL REPRESENTATIONS - A DIRECT APPROACH TO LEARNING AND GENERALIZATION IN MULTILAYER NEURAL NETWORKS
    MONASSON, R
    ZECCHINA, R
    PHYSICAL REVIEW LETTERS, 1995, 75 (12) : 2432 - 2435
  • [40] Novel approach for explaining the behavior of trained artificial neural networks with distributed representations
    Zhou, Yuan-hui
    Lu, Yu-chang
    Shi, Chun-yi
    Chinese Journal of Advanced Software Research, 1999, 6 (01): : 49 - 60