Unsupervised neural network models of the ventral visual stream

被引：171

作者：

Zhuang, Chengxu ^{[1
]}

Yan, Siming ^{[2
]}

Nayebi, Aran ^{[3
]}

Schrimpf, Martin ^{[4
]}

Frank, Michael C. ^{[1
]}

DiCarlo, James J. ^{[4
]}

Yamins, Daniel L. K. ^{[1
,5
,6
]}

机构：

[1] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA

[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA

[3] Stanford Univ, Neurosci PhD Program, Stanford, CA 94305 USA

[4] MIT, Brain & Cognit Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

[6] Stanford Univ, Wu Tsai Neurosci Inst, Stanford, CA 94305 USA

来源：

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA | 2021年 / 118卷 / 03期

基金：

美国国家科学基金会;

关键词：

ventral visual stream; deep neural networks; unsupervised algorithms; RECEPTIVE-FIELDS; AREA V4; RECOGNITION; INFANTS; INFORMATION; SELECTIVITY; FRAMEWORK; RESPONSES; FEATURES; PATHWAY;

D O I：

10.1073/pnas.2014196118

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Deep neural networks currently provide the best quantitative models of the response patterns of neurons throughout the primate ventral visual stream. However, such networks have remained implausible as a model of the development of the ventral stream, in part because they are trained with supervised methods requiring many more labels than are accessible to infants during development. Here, we report that recent rapid progress in unsupervised learning has largely closed this gap. We find that neural network models learned with deep unsupervised contrastive embedding methods achieve neural prediction accuracy in multiple ventral visual cortical areas that equals or exceeds that of models derived using today's best supervised methods and that the mapping of these neural network models' hidden layers is neuroanatomically consistent across the ventral stream. Strikingly, we find that these methods produce brainlike representations even when trained solely with real human child developmental data collected from head-mounted cameras, despite the fact that these datasets are noisy and limited. We also find that semisupervised deep contrastive embeddings can leverage small numbers of labeled examples to produce representations with substantially improved error-pattern consistency to human behavior. Taken together, these results illustrate a use of unsupervised learning to provide a quantitative model of a multiarea cortical brain system and present a strong candidate for a biologically plausible computational theory of primate sensory learning.

引用

页数：11

共 50 条

[31] Ventral and Dorsal Visual Stream Contributions to the Perception of Object Shape and Object Location
Zachariou, Valentinos
Klatzky, Roberta
Behrmann, Marlene
JOURNAL OF COGNITIVE NEUROSCIENCE, 2014, 26 (01) : 189 - 209
[32] Object Recognition at Higher Regions of the Ventral Visual Stream via Dynamic Inference
Sorooshyari, Siamak K.
Sheng, Huanjie
Poor, H. Vincent
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2020, 14
[33] Spatial attention modulates visual gamma oscillations across the human ventral stream
Magazzini, Lorenzo
Singh, Krish D.
NEUROIMAGE, 2018, 166 : 219 - 229
[34] Neuronal Learning of Invariant Object Representation in the Ventral Visual Stream Is Not Dependent on Reward
Li, Nuo
DiCarlo, James J.
JOURNAL OF NEUROSCIENCE, 2012, 32 (19): : 6611 - 6620
[35] Perceptual processing in the ventral visual stream requires area TE but not rhinal cortex
Eldridge, Mark A. G.
Matsumoto, Narihisa
Wittig, John H. Jnr
Masseau, Evan C.
Saunders, Richard C.
Richmond, Barry J.
ELIFE, 2018, 7
[36] REM sleep behaviour disorder and visuoperceptive dysfunction: a disorder of the ventral visual stream?
Marques, Ana
Dujardin, Kathy
Boucart, Muriel
Pins, Delphine
Delliaux, Marie
Defebvre, Luc
Derambure, Philippe
Monaca, Christelle
JOURNAL OF NEUROLOGY, 2010, 257 (03) : 383 - 391
[37] Differential modulation of visual object processing in dorsal and ventral stream by stimulus visibility
Ludwig, Karin
Sterzer, Philipp
Kathmann, Norbert
Hesselmann, Guido
CORTEX, 2016, 83 : 113 - 123
[38] Unsupervised learning to detect loops using deep neural networks for visual SLAM system
Gao, Xiang
Zhang, Tao
AUTONOMOUS ROBOTS, 2017, 41 (01) : 1 - 18
[39] Metamers of the ventral stream
Freeman, Jeremy
Simoncelli, Eero P.
NATURE NEUROSCIENCE, 2011, 14 (09) : 1195 - U130
[40] Unwrapping the Ventral Stream
Freeman, Jeremy
Ziemba, Corey M.
JOURNAL OF NEUROSCIENCE, 2011, 31 (07): : 2349 - 2351

← 1 2 3 4 5 →