A computational model for visual selection

被引:101
作者
Amit, Y [1 ]
Geman, D
机构
[1] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
[2] Univ Massachusetts, Dept Math & Stat, Amherst, MA 01003 USA
关键词
D O I
10.1162/089976699300016197
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a computational model for detecting and localizing instances from an object class in static gray-level images. We divide detection into visual selection and final classification, concentrating on the former: drastically reducing the number of candidate regions that require further, usually more intensive, processing, but with a minimum of computation and missed detections. Bottom-up processing is based on local groupings of edge fragments constrained by loose geometrical relationships. They have no a priori semantic or geometric interpretation. The role of training is to select special groupings that are moderately likely at certain places on the object but rare in the background. We show that the statistics in both populations are stable. The candidate regions are those that contain global arrangements of several local groupings. Whereas our model was not conceived to explain brain functions, it does cohere with evidence about the functions of neurons in V1 and V2, such as responses to coarse or incomplete patterns (e.g., illusory contours) and to scale and translation invariance in IT. Finally, the algorithm is applied to face and symbol detection.
引用
收藏
页码:1691 / 1715
页数:25
相关论文
共 26 条
  • [1] Shape quantization and recognition with randomized trees
    Amit, Y
    Geman, D
    [J]. NEURAL COMPUTATION, 1997, 9 (07) : 1545 - 1588
  • [2] AMIT Y, 1998, 474 U CHIC
  • [3] Amit Y., 1998, FACE RECOGNITION THE
  • [4] HUMAN IMAGE UNDERSTANDING - RECENT RESEARCH AND A THEORY
    BIEDERMAN, I
    [J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 32 (01): : 29 - 73
  • [5] Biederman I, 1981, SEMANTICS GLANCE SCE
  • [6] PSYCHOPHYSICAL SUPPORT FOR A 2-DIMENSIONAL VIEW INTERPOLATION THEORY OF OBJECT RECOGNITION
    BULTHOFF, HH
    EDELMAN, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (01) : 60 - 64
  • [7] Desimone Robert, 1995, P475
  • [8] COLUMNS FOR VISUAL FEATURES OF OBJECTS IN MONKEY INFEROTEMPORAL CORTEX
    FUJITA, I
    TANAKA, K
    ITO, M
    CHENG, K
    [J]. NATURE, 1992, 360 (6402) : 343 - 346
  • [9] MACAQUE-V1 NEURONS CAN SIGNAL ILLUSORY CONTOURS
    GROSOF, DH
    SHAPLEY, RM
    HAWKEN, MJ
    [J]. NATURE, 1993, 365 (6446) : 550 - 552
  • [10] Hubel H. David, 1988, EYE BRAIN VISION