Unsupervised neural network models of the ventral visual stream

被引:171
|
作者
Zhuang, Chengxu [1 ]
Yan, Siming [2 ]
Nayebi, Aran [3 ]
Schrimpf, Martin [4 ]
Frank, Michael C. [1 ]
DiCarlo, James J. [4 ]
Yamins, Daniel L. K. [1 ,5 ,6 ]
机构
[1] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[3] Stanford Univ, Neurosci PhD Program, Stanford, CA 94305 USA
[4] MIT, Brain & Cognit Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[6] Stanford Univ, Wu Tsai Neurosci Inst, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
ventral visual stream; deep neural networks; unsupervised algorithms; RECEPTIVE-FIELDS; AREA V4; RECOGNITION; INFANTS; INFORMATION; SELECTIVITY; FRAMEWORK; RESPONSES; FEATURES; PATHWAY;
D O I
10.1073/pnas.2014196118
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep neural networks currently provide the best quantitative models of the response patterns of neurons throughout the primate ventral visual stream. However, such networks have remained implausible as a model of the development of the ventral stream, in part because they are trained with supervised methods requiring many more labels than are accessible to infants during development. Here, we report that recent rapid progress in unsupervised learning has largely closed this gap. We find that neural network models learned with deep unsupervised contrastive embedding methods achieve neural prediction accuracy in multiple ventral visual cortical areas that equals or exceeds that of models derived using today's best supervised methods and that the mapping of these neural network models' hidden layers is neuroanatomically consistent across the ventral stream. Strikingly, we find that these methods produce brainlike representations even when trained solely with real human child developmental data collected from head-mounted cameras, despite the fact that these datasets are noisy and limited. We also find that semisupervised deep contrastive embeddings can leverage small numbers of labeled examples to produce representations with substantially improved error-pattern consistency to human behavior. Taken together, these results illustrate a use of unsupervised learning to provide a quantitative model of a multiarea cortical brain system and present a strong candidate for a biologically plausible computational theory of primate sensory learning.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Modelling attention control using a convolutional neural network designed after the ventral visual pathway
    Yu, Chen-Ping
    Liu, Huidong
    Samaras, Dimitrios
    Zelinsky, Gregory J.
    VISUAL COGNITION, 2019, 27 (5-8) : 416 - 434
  • [22] Face-Specific Activity in the Ventral Stream Visual Cortex Linked to Conscious Face Perception
    Li, Wenlu
    Cao, Dan
    Li, Jin
    Jiang, Tianzi
    NEUROSCIENCE BULLETIN, 2024, 40 (10) : 1434 - 1444
  • [23] Investigating Category- and Shape-Selective Neural Processing in Ventral and Dorsal Visual Stream Under Interocular Suppression
    Ludwig, Karin
    Kathmann, Norbert
    Sterzer, Philipp
    Hesselmann, Guido
    HUMAN BRAIN MAPPING, 2015, 36 (01) : 137 - 149
  • [24] Evidence for rapid localist plasticity in the ventral visual stream: the example of words
    Riesenhuber, Maximilian
    Glezer, Laurie S.
    LANGUAGE COGNITION AND NEUROSCIENCE, 2017, 32 (03) : 286 - 294
  • [25] A cross-validated cytoarchitectonic atlas of the human ventral visual stream
    Rosenke, Mona
    Weiner, Kevin S.
    Barnett, Michael A.
    Zilles, Karl
    Amunts, Katrin
    Goebel, Rainer
    Grill-Spector, Kalanit
    NEUROIMAGE, 2018, 170 : 257 - 270
  • [26] Robust handwritten character recognition with features inspired by visual ventral stream
    Borji, Ali
    Hamidi, Mandana
    Mahmoudi, Fariborz
    NEURAL PROCESSING LETTERS, 2008, 28 (02) : 97 - 111
  • [27] Computational modeling of the neural representation of object shape in the primate ventral visual system
    Eguchi, Akihiro
    Mender, Bedeho M. W.
    Evans, Benjamin D.
    Humphreys, Glyn W.
    Stringer, Simon M.
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2015, 9
  • [28] Mid-level visual features underlie the high-level categorical organization of the ventral stream
    Long, Bria
    Yu, Chen-Ping
    Konkle, Talia
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (38) : E9015 - E9024
  • [29] Do object-category selective regions in the ventral visual stream represent perceived distance information?
    Amit, Elinor
    Mehoudar, Eyal
    Trope, Yaacov
    Yovel, Galit
    BRAIN AND COGNITION, 2012, 80 (02) : 201 - 213
  • [30] Transformation of Visual Representations Across Ventral Stream Body-selective Patches
    Kumar, Satwant
    Popivanov, Ivo D.
    Vogels, Rufin
    CEREBRAL CORTEX, 2019, 29 (01) : 215 - 229