Object recognition and segmentation by a fragment-based hierarchy

被引:179
作者
Ullman, Shimon [1 ]
机构
[1] Weizmann Inst Sci, Dept Comp Sci & Appl Math, IL-76100 Rehovot, Israel
关键词
VISUAL CATEGORIZATION; TEMPORAL CORTEX; SHAPE; REPRESENTATION; INVARIANCE; ORGANIZATION; SELECTIVITY; EXPERIENCE; FEATURES; MONKEYS;
D O I
10.1016/j.tics.2006.11.009
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
How do we learn to recognize visual categories, such as dogs and cats? Somehow, the brain uses limited variable examples to extract the essential characteristics of new visual categories. Here, I describe an approach to category learning and recognition that is based on recent computational advances. In this approach, objects are represented by a hierarchy of fragments that are extracted during learning from observed examples. The fragments are class-specific features and are selected to deliver a high amount of information for categorization. The same fragments hierarchy is then used for general categorization, individual object recognition and object-parts identification. Recognition is also combined with object segmentation, using stored fragments, to provide a top-down process that delineates object boundaries in complex cluttered scenes. The approach is computationally effective and provides a possible framework for categorization, recognition and segmentation in human vision.
引用
收藏
页码:58 / 64
页数:7
相关论文
共 49 条
[1]   Impact of learning on representation of parts and wholes in monkey inferotemporal cortex [J].
Baker, CI ;
Behrmann, M ;
Olson, CR .
NATURE NEUROSCIENCE, 2002, 5 (11) :1210-1216
[2]  
Bart E, 2005, PROC CVPR IEEE, P672
[3]  
Bart E, 2004, LECT NOTES COMPUT SC, V3022, P152
[4]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[5]   HUMAN IMAGE UNDERSTANDING - RECENT RESEARCH AND A THEORY [J].
BIEDERMAN, I .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 32 (01) :29-73
[6]  
Borenstein E, 2004, LECT NOTES COMPUT SC, V3023, P315
[7]  
Borenstein E, 2002, LECT NOTES COMPUT SC, V2351, P109
[8]  
BORENSTEIN E, 2004, IEEE C COMP VIS PATT, P46
[9]   Bootstrapped learning of novel objects [J].
Brady, MJ ;
Kersten, D .
JOURNAL OF VISION, 2003, 3 (06) :413-422
[10]   Underlying principles of visual shape selectivity in posterior inferotemporal cortex [J].
Brincat, SL ;
Connor, CE .
NATURE NEUROSCIENCE, 2004, 7 (08) :880-886