Object recognition and segmentation by a fragment-based hierarchy

被引：179

作者：

Ullman, Shimon ^{[1
]}

机构：

[1] Weizmann Inst Sci, Dept Comp Sci & Appl Math, IL-76100 Rehovot, Israel

来源：

TRENDS IN COGNITIVE SCIENCES | 2007年 / 11卷 / 02期

关键词：

VISUAL CATEGORIZATION; TEMPORAL CORTEX; SHAPE; REPRESENTATION; INVARIANCE; ORGANIZATION; SELECTIVITY; EXPERIENCE; FEATURES; MONKEYS;

D O I：

10.1016/j.tics.2006.11.009

中图分类号：

B84 [心理学]; C [社会科学总论]; Q98 [人类学];

学科分类号：

03 ; 0303 ; 030303 ; 04 ; 0402 ;

摘要：

How do we learn to recognize visual categories, such as dogs and cats? Somehow, the brain uses limited variable examples to extract the essential characteristics of new visual categories. Here, I describe an approach to category learning and recognition that is based on recent computational advances. In this approach, objects are represented by a hierarchy of fragments that are extracted during learning from observed examples. The fragments are class-specific features and are selected to deliver a high amount of information for categorization. The same fragments hierarchy is then used for general categorization, individual object recognition and object-parts identification. Recognition is also combined with object segmentation, using stored fragments, to provide a top-down process that delineates object boundaries in complex cluttered scenes. The approach is computationally effective and provides a possible framework for categorization, recognition and segmentation in human vision.

引用

页码：58 / 64

页数：7

共 49 条

[1] Impact of learning on representation of parts and wholes in monkey inferotemporal cortex [J].

Baker, CI ;

Behrmann, M ;

Olson, CR .

NATURE NEUROSCIENCE, 2002, 5 (11) :1210-1216

[2]

Bart E, 2005, PROC CVPR IEEE, P672

[3]

Bart E, 2004, LECT NOTES COMPUT SC, V3022, P152

[4] AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].

BELL, AJ ;

SEJNOWSKI, TJ .

NEURAL COMPUTATION, 1995, 7 (06) :1129-1159

[5] HUMAN IMAGE UNDERSTANDING - RECENT RESEARCH AND A THEORY [J].

BIEDERMAN, I .

COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 32 (01) :29-73

[6]

Borenstein E, 2004, LECT NOTES COMPUT SC, V3023, P315

[7]

Borenstein E, 2002, LECT NOTES COMPUT SC, V2351, P109

[8]

BORENSTEIN E, 2004, IEEE C COMP VIS PATT, P46

[9] Bootstrapped learning of novel objects [J].

Brady, MJ ;

Kersten, D .

JOURNAL OF VISION, 2003, 3 (06) :413-422

[10] Underlying principles of visual shape selectivity in posterior inferotemporal cortex [J].

Brincat, SL ;

Connor, CE .

NATURE NEUROSCIENCE, 2004, 7 (08) :880-886

← 1 2 3 4 5 →