Vision in the real world:: Finding, attending and recognizing objects

被引:19
作者
Bjoerkman, Marten [1 ]
Eklundh, Jan-Olof [1 ]
机构
[1] Royal Inst Technol, Comp Vis & Act Percept Lab, Stockholm, Sweden
关键词
robot vision; recognition; visual search; real-time vision;
D O I
10.1002/ima.20087
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we discuss the notion of a "seeing" system that uses vision to interact with its environment. The requirements on such a system depend on the tasks it is involved in and should be evaluated with these in mind. Here we consider the task of finding and recognizing objects in the real world. After a discussion of the needed functionalities and issues about the design we present an integrated real-time vision system capable of finding, attending and recognizing objects in real settings. The system is based on a dual set of cameras, a wide field set for attention and a foveal one for recognition. The continuously running attentional process uses top-down object characteristics in terms of hue and 3D size. Recognition is performed with objects of interest foveated and segmented from its background. We describe the system structure as well as the different components in detail and present experimental evaluations of its overall performance. (C) 2007 Wiley Periodicals, Inc.
引用
收藏
页码:189 / 208
页数:20
相关论文
共 59 条
[1]   PERSPECTIVE APPROXIMATIONS [J].
ALOIMONOS, JY .
IMAGE AND VISION COMPUTING, 1990, 8 (03) :179-192
[2]   Shape indexing using approximate nearest-neighbour search in high-dimensional spaces [J].
Beis, JS ;
Lowe, DG .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :1000-1006
[3]   The bas-relief ambiguity [J].
Belhumeur, PN ;
Kriegman, DJ ;
Yuille, AL .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :1060-1066
[4]   Real-time epipolar geometry estimation of binocular stereo heads [J].
Björkman, M ;
Eklundh, JO .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (03) :425-432
[5]  
Björkman M, 2000, PROC CVPR IEEE, P506, DOI 10.1109/CVPR.2000.854897
[6]  
BJORKMAN M, 2002, THESIS ROYAL I TECHN
[7]  
BJORKMAN M, 2004, IEEE INT C ROB AUT A
[8]   SACCADE AND PURSUIT ON AN ACTIVE HEAD EYE PLATFORM [J].
BRADSHAW, KJ ;
MCLAUCHLAN, PF ;
REID, ID ;
MURRAY, DW .
IMAGE AND VISION COMPUTING, 1994, 12 (03) :155-163
[9]   FACE RECOGNITION - FEATURES VERSUS TEMPLATES [J].
BRUNELLI, R ;
POGGIO, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (10) :1042-1052
[10]   SMART SENSING WITHIN A PYRAMID VISION MACHINE [J].
BURT, PJ .
PROCEEDINGS OF THE IEEE, 1988, 76 (08) :1006-1015