Unsupervised neural network models of the ventral visual stream

被引:171
|
作者
Zhuang, Chengxu [1 ]
Yan, Siming [2 ]
Nayebi, Aran [3 ]
Schrimpf, Martin [4 ]
Frank, Michael C. [1 ]
DiCarlo, James J. [4 ]
Yamins, Daniel L. K. [1 ,5 ,6 ]
机构
[1] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[3] Stanford Univ, Neurosci PhD Program, Stanford, CA 94305 USA
[4] MIT, Brain & Cognit Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[6] Stanford Univ, Wu Tsai Neurosci Inst, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
ventral visual stream; deep neural networks; unsupervised algorithms; RECEPTIVE-FIELDS; AREA V4; RECOGNITION; INFANTS; INFORMATION; SELECTIVITY; FRAMEWORK; RESPONSES; FEATURES; PATHWAY;
D O I
10.1073/pnas.2014196118
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep neural networks currently provide the best quantitative models of the response patterns of neurons throughout the primate ventral visual stream. However, such networks have remained implausible as a model of the development of the ventral stream, in part because they are trained with supervised methods requiring many more labels than are accessible to infants during development. Here, we report that recent rapid progress in unsupervised learning has largely closed this gap. We find that neural network models learned with deep unsupervised contrastive embedding methods achieve neural prediction accuracy in multiple ventral visual cortical areas that equals or exceeds that of models derived using today's best supervised methods and that the mapping of these neural network models' hidden layers is neuroanatomically consistent across the ventral stream. Strikingly, we find that these methods produce brainlike representations even when trained solely with real human child developmental data collected from head-mounted cameras, despite the fact that these datasets are noisy and limited. We also find that semisupervised deep contrastive embeddings can leverage small numbers of labeled examples to produce representations with substantially improved error-pattern consistency to human behavior. Taken together, these results illustrate a use of unsupervised learning to provide a quantitative model of a multiarea cortical brain system and present a strong candidate for a biologically plausible computational theory of primate sensory learning.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Characterizing the Ventral Visual Stream with Response-Optimized Neural Encoding Models
    Khosla, Meenakshi
    Jamison, Keith
    Kuceyeski, Amy
    Sabuncu, Mert R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] Fast ventral stream neural activity enables rapid visual categorization
    Cauchoix, Maxime
    Crouzet, Sebastien M.
    Fize, Denis
    Serre, Thomas
    NEUROIMAGE, 2016, 125 : 280 - 290
  • [3] Visual prototypes in the ventral stream are attuned to complexity and gaze behavior
    Rose, Olivia
    Johnson, James
    Wang, Binxu
    Ponce, Carlos R.
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [4] ABM and CNN Application in Ventral Stream of Visual System
    Yousefi, B.
    Yousefi, Paria
    2015 IEEE STUDENT SYMPOSIUM IN BIOMEDICAL ENGINEERING & SCIENCES (ISSBES), 2015, : 87 - 92
  • [5] Neural dynamics at successive stages of the ventral visual stream are consistent with hierarchical error signals
    Issa, Elias B.
    Cadieu, Charles F.
    DiCarlo, James J.
    ELIFE, 2018, 7
  • [6] LEARNING HIDDEN UNIT CONTRIBUTIONS FOR UNSUPERVISED SPEAKER ADAPTATION OF NEURAL NETWORK ACOUSTIC MODELS
    Swietojanski, Pawel
    Renals, Steve
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 171 - 176
  • [7] Neural "Ignition": Enhanced Activation Linked to Perceptual Awareness in Human Ventral Stream Visual Cortex
    Fisch, Lior
    Privman, Eran
    Ramot, Michal
    Harel, Michal
    Nir, Yuval
    Kipervasser, Svetlana
    Andelman, Fani
    Neufeld, Miri Y.
    Kramer, Uri
    Fried, Itzhak
    Malach, Rafael
    NEURON, 2009, 64 (04) : 562 - 574
  • [8] Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream
    Guclu, Umut
    van Gerven, Marcel A. J.
    JOURNAL OF NEUROSCIENCE, 2015, 35 (27): : 10005 - 10014
  • [9] CSFL: A novel unsupervised convolution neural network approach for visual pattern classification
    Rehman, Sadaqat Ur
    Tu, Shanshan
    Huang, Yongfeng
    Liu, Guojie
    AI COMMUNICATIONS, 2017, 30 (05) : 311 - 324
  • [10] Crossmodal Recruitment of the Ventral Visual Stream in Congenital Blindness
    Ptito, Maurice
    Matteau, Isabelle
    Wang, Arthur Zhi
    Paulson, Olaf B.
    Siebner, Hartwig R.
    Kupers, Ron
    NEURAL PLASTICITY, 2012, 2012