Object Recognition at Higher Regions of the Ventral Visual Stream via Dynamic Inference

被引:4
作者
Sorooshyari, Siamak K. [1 ]
Sheng, Huanjie [1 ,3 ]
Poor, H. Vincent [2 ]
机构
[1] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[2] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[3] Roche Sequencing Solut Inc, Santa Clara, CA USA
基金
美国国家科学基金会;
关键词
object recognition; sequence estimation; decoding; IT cortex; dynamic inference; Viterbi algorithm; INFERIOR TEMPORAL CORTEX; NEURAL MECHANISMS; PYRAMIDAL NEURONS; CEREBRAL-CORTEX; ATTENTION; VISION; RESPONSES; SYSTEM; CELL; REPRESENTATION;
D O I
10.3389/fncom.2020.00046
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The ventral visual stream (VVS) is a fundamental pathway involved in visual object identification and recognition. In this work, we present a hypothesis of a sequence of computations performed by the VVS during object recognition. The operations performed by the inferior temporal (IT) cortex are represented as not being akin to a neural-network, but rather in-line with a dynamic inference instantiation of the untangling notion. The presentation draws upon a technique for dynamic maximum a posteriori probability (MAP) sequence estimation based on the Viterbi algorithm. Simulation results are presented to show that the decoding portion of the architecture that is associated with the IT can effectively untangle object identity when presented with synthetic data. More importantly, we take a step forward in visual neuroscience by presenting a framework for an inference-based approach that is biologically inspired via attributes implicated in primate object recognition. The analysis will provide insight in explaining the exceptional proficiency of the VVS.
引用
收藏
页数:22
相关论文
共 76 条
[1]  
Andrews K., 1997, TECHNICAL REPORT
[2]  
[Anonymous], 1965, PRINCIPLES COMMUNICA
[3]   Neural Mechanisms of Object-Based Attention [J].
Baldauf, Daniel ;
Desimone, Robert .
SCIENCE, 2014, 344 (6182) :424-427
[4]   RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING [J].
BIEDERMAN, I .
PSYCHOLOGICAL REVIEW, 1987, 94 (02) :115-147
[5]   Compression in Visual Working Memory: Using Statistical Regularities to Form More Efficient Memory Representations [J].
Brady, Timothy F. ;
Konkle, Talia ;
Alvarez, George A. .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2009, 138 (04) :487-502
[6]   A model of V4 shape selectivity and invariance [J].
Cadieu, Charles ;
Kouh, Minjoon ;
Pasupathy, Anitha ;
Connor, Charles E. ;
Riesenhuber, Maximilian ;
Poggio, Tomaso .
JOURNAL OF NEUROPHYSIOLOGY, 2007, 98 (03) :1733-1750
[7]  
Chang JJ, 1997, IEEE T INFORM THEORY, V43, P1682, DOI 10.1109/18.623175
[8]   Responses of neurons in inferior temporal cortex during memory-guided visual search [J].
Chelazzi, L ;
Duncan, J ;
Miller, EK ;
Desimone, R .
JOURNAL OF NEUROPHYSIOLOGY, 1998, 80 (06) :2918-2940
[9]   THE WEIGHT SPECTRA OF SOME SHORT LOW-RATE CONVOLUTIONAL-CODES [J].
CONAN, J .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1984, 32 (09) :1050-1053
[10]   Attention during natural vision warps semantic representation across the human brain [J].
Cukur, Tolga ;
Nishimoto, Shinji ;
Huth, Alexander G. ;
Gallant, Jack L. .
NATURE NEUROSCIENCE, 2013, 16 (06) :763-+