A Bayesian Framework for Active Artificial Perception

被引:36
作者
Ferreira, Joao Filipe [1 ,2 ]
Lobo, Jorge [1 ,2 ]
Bessiere, Pierre [3 ]
Castelo-Branco, Miguel [4 ]
Dias, Jorge [1 ,2 ,5 ]
机构
[1] Univ Coimbra, Inst Syst & Robot, P-3030290 Coimbra, Portugal
[2] Univ Coimbra, Fac Sci & Technol, P-3030290 Coimbra, Portugal
[3] Coll France, CNRS, F-75794 Paris 16, France
[4] Univ Coimbra, Biomed Inst Res Light & Image, Fac Med, P-3030290 Coimbra, Portugal
[5] Khalifa Univ Sci Technol & Res, Abu Dhabi 127788, U Arab Emirates
关键词
Active perception; Bayesian programming; biologically inspired robots; computer vision; multimodal perception; multisensory exploration; sensing and perception; sensor fusion; SOURCE LOCALIZATION; VISION; INTEGRATION;
D O I
10.1109/TSMCB.2012.2214477
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a Bayesian framework for the active multimodal perception of 3-D structure and motion. The design of this framework finds its inspiration in the role of the dorsal perceptual pathway of the human brain. Its composing models build upon a common egocentric spatial configuration that is naturally fitting for the integration of readings from multiple sensors using a Bayesian approach. In the process, we will contribute with efficient and robust probabilistic solutions for cyclopean geometry-based stereovision and auditory perception based only on binaural cues, modeled using a consistent formalization that allows their hierarchical use as building blocks for the multimodal sensor fusion framework. We will explicitly or implicitly address the most important challenges of sensor fusion using this framework, for vision, audition, and vestibular sensing. Moreover, interaction and navigation require maximal awareness of spatial surroundings, which, in turn, is obtained through active attentional and behavioral exploration of the environment. The computational models described in this paper will support the construction of a simultaneously flexible and powerful robotic implementation of multimodal active perception to be used in real-world applications, such as human-machine interaction or mobile robot navigation.
引用
收藏
页码:699 / 711
页数:13
相关论文
共 48 条
[1]   Active 3D Object Localization Using a Humanoid Robot [J].
Andreopoulos, Alexander ;
Hasler, Stephan ;
Wersing, Heiko ;
Janssen, Herbert ;
Tsotsos, John K. ;
Koerner, Edgar .
IEEE TRANSACTIONS ON ROBOTICS, 2011, 27 (01) :47-64
[2]   Neural representation of probabilistic information [J].
Barber, MJ ;
Clark, JW ;
Anderson, CH .
NEURAL COMPUTATION, 2003, 15 (08) :1843-1864
[3]  
Bessiere P, 2008, SPRINGER TRAC ADV RO, V46, P1
[4]   Structure and function of visual area MT [J].
Born, RT ;
Bradley, DC .
ANNUAL REVIEW OF NEUROSCIENCE, 2005, 28 :157-189
[5]  
Bouguet JY, Camera calibration toolbox for matlab
[6]   Active vision for sociable robots [J].
Breazeal, C ;
Edsinger, A ;
Fitzpatrick, P ;
Scassellati, B .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2001, 31 (05) :443-453
[7]  
Carpenter R.H. S., 2004, Advances in Clinical Neuroscience and Rehabilitation, V4, P6, DOI DOI 10.1016/J.CUB.2004.08.058
[8]   The time course of visual information accrual guiding eye movement decisions [J].
Caspi, A ;
Beutter, BR ;
Eckstein, MP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (35) :13086-13090
[9]   Source localization and beamforming [J].
Chen, JC ;
Yao, K ;
Hudson, RE .
IEEE SIGNAL PROCESSING MAGAZINE, 2002, 19 (02) :30-39
[10]  
Dankers A, 2005, 2005 IEEE Intelligent Vehicles Symposium Proceedings, P187