Learning bio-inspired head-centric representations of 3D shapes in an active fixation setting

被引:0
作者
Kalou, Katerina [1 ]
Sedda, Giulia [1 ]
Gibaldi, Agostino [2 ]
Sabatini, Silvio P. [1 ]
机构
[1] Univ Genoa, Dept Informat Bioengn Robot & Syst Engn, Genoa, Italy
[2] Univ Calif Berkeley, Sch Optometry, Berkeley, CA USA
来源
FRONTIERS IN ROBOTICS AND AI | 2022年 / 9卷
基金
美国国家卫生研究院;
关键词
stereopsis; vergent geometry; binocular disparity; active fixations; 3D shape perception; recurrent hierarchical networks; VISUAL-DISCRIMINATION; DISPARITY; DEPTH; ORIENTATION; VISION; SLANT; PERCEPTION; AMBLYOPIA; ALGORITHM; MODELS;
D O I
10.3389/frobt.2022.994284
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
When exploring the surrounding environment with the eyes, humans and primates need to interpret three-dimensional (3D) shapes in a fast and invariant way, exploiting a highly variant and gaze-dependent visual information. Since they have front-facing eyes, binocular disparity is a prominent cue for depth perception. Specifically, it serves as computational substrate for two ground mechanisms of binocular active vision: stereopsis and binocular coordination. To this aim, disparity information, which is expressed in a retinotopic reference frame, is combined along the visual cortical pathways with gaze information and transformed in a head-centric reference frame. Despite the importance of this mechanism, the underlying neural substrates still remain widely unknown. In this work, we investigate the capabilities of the human visual system to interpret the 3D scene exploiting disparity and gaze information. In a psychophysical experiment, human subjects were asked to judge the depth orientation of a planar surface either while fixating a target point or while freely exploring the surface. Moreover, we used the same stimuli to train a recurrent neural network to exploit the responses of a modelled population of cortical (V1) cells to interpret the 3D scene layout. The results for both human performance and from the model network show that integrating disparity information across gaze directions is crucial for a reliable and invariant interpretation of the 3D geometry of the scene.
引用
收藏
页数:15
相关论文
共 48 条
[11]   Neural encoding of binocular disparity: Energy models, position shifts and phase shifts [J].
Fleet, DJ ;
Wagner, H ;
Heeger, DJ .
VISION RESEARCH, 1996, 36 (12) :1839-1857
[12]   The Processing of Three-Dimensional Shape from Disparity in the Human Brain [J].
Georgieva, Svetlana ;
Peeters, Ronald ;
Kolster, Hauke ;
Todd, James T. ;
Orban, Guy A. .
JOURNAL OF NEUROSCIENCE, 2009, 29 (03) :727-742
[13]  
Gibaldi A., 2016, P MODVIS 2016 WORKSH
[14]   Crossed-uncrossed projections from primate retina are adapted to disparities of natural scenes [J].
Gibaldi, Agostino ;
Benson, Noah C. ;
Banks, Martin S. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (07)
[15]   Binocular Eye Movements Are Adapted to the Natural Environment [J].
Gibaldi, Agostino ;
Banks, Martin S. .
JOURNAL OF NEUROSCIENCE, 2019, 39 (15) :2877-2888
[16]   Evaluation of the Tobii EyeX Eye tracking controller and Matlab toolkit for research [J].
Gibaldi, Agostino ;
Vanegas, Mauricio ;
Bex, Peter J. ;
Maiello, Guido .
BEHAVIOR RESEARCH METHODS, 2017, 49 (03) :923-946
[17]   The Active Side of Stereopsis: Fixation Strategy and Adaptation to Natural Environments [J].
Gibaldi, Agostino ;
Canessa, Andrea ;
Sabatini, Silvio P. .
SCIENTIFIC REPORTS, 2017, 7
[18]   A cortical model for binocular vergence control without explicit calculation of disparity [J].
Gibaldi, Agostino ;
Chessa, Manuela ;
Canessa, Andrea ;
Sabatini, Silvio P. ;
Solari, Fabio .
NEUROCOMPUTING, 2010, 73 (7-9) :1065-1073
[19]   Cyclopean geometry of binocular vision [J].
Hansard, Miles ;
Horaud, Radu .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2008, 25 (09) :2357-2369
[20]   Three-dimensional orientation tuning in macaque area V4 [J].
Hinkle, DA ;
Connor, CE .
NATURE NEUROSCIENCE, 2002, 5 (07) :665-670