Esaliency (Extended Saliency): Meaningful Attention Using Stochastic Image Modeling

被引:95
作者
Avraham, Tamar [1 ]
Lindenbaum, Michael [1 ]
机构
[1] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel
关键词
Computer vision; scene analysis; similarity measures; performance evaluation of algorithms and systems; object recognition; visual search; attention; VISUAL-ATTENTION; SELECTIVE ATTENTION; SEARCH; SEGMENTATION; ORGANIZATION; NETWORKS;
D O I
10.1109/TPAMI.2009.53
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer vision attention processes assign variable-hypothesized importance to different parts of the visual input and direct the allocation of computational resources. This nonuniform allocation might help accelerate the image analysis process. This paper proposes a new bottom-up attention mechanism. Rather than taking the traditional approach, which tries to model human attention, we propose a validated stochastic model to estimate the probability that an image part is of interest. We refer to this probability as saliency and thus specify saliency in a mathematically well-defined sense. The model quantifies several intuitive observations, such as the greater likelihood of correspondence between visually similar image regions and the likelihood that only a few of interesting objects will be present in the scene. The latter observation, which implies that such objects are (relaxed) global exceptions, replaces the traditional preference for local contrast. The algorithm starts with a rough preattentive segmentation and then uses a graphical model approximation to efficiently reveal which segments are more likely to be of interest. Experiments on natural scenes containing a variety of objects demonstrate the proposed method and show its advantages over previous approaches.
引用
收藏
页码:693 / 708
页数:16
相关论文
共 60 条
[11]   THE ECCENTRICITY EFFECT - TARGET ECCENTRICITY AFFECTS PERFORMANCE ON CONJUNCTION SEARCHES [J].
CARRASCO, M ;
EVERT, DL ;
CHANG, I ;
KATZ, SM .
PERCEPTION & PSYCHOPHYSICS, 1995, 57 (08) :1241-1261
[12]   Blobworld: Image segmentation using expectation-maximization and its application to image querying [J].
Carson, C ;
Belongie, S ;
Greenspan, H ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (08) :1026-1038
[13]   Learning Bayesian networks from data: An information-theory based approach [J].
Cheng, J ;
Greiner, R ;
Kelly, J ;
Bell, D ;
Liu, WR .
ARTIFICIAL INTELLIGENCE, 2002, 137 (1-2) :43-90
[14]   APPROXIMATING DISCRETE PROBABILITY DISTRIBUTIONS WITH DEPENDENCE TREES [J].
CHOW, CK ;
LIU, CN .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1968, 14 (03) :462-+
[15]   DENSITY EFFECTS IN CONJUNCTION SEARCH - EVIDENCE FOR A COARSE LOCATION MECHANISM OF FEATURE INTEGRATION [J].
COHEN, A ;
IVRY, RB .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1991, 17 (04) :891-901
[16]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[17]   Evaluation of selective attention under similarity transformations [J].
Draper, BA ;
Lionelle, A .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2005, 100 (1-2) :152-171
[18]   VISUAL-SEARCH AND STIMULUS SIMILARITY [J].
DUNCAN, J ;
HUMPHREYS, GW .
PSYCHOLOGICAL REVIEW, 1989, 96 (03) :433-458
[20]   VISUAL-ATTENTION WITHIN AND AROUND THE FIELD OF FOCAL ATTENTION - A ZOOM LENS MODEL [J].
ERIKSEN, CW ;
STJAMES, JD .
PERCEPTION & PSYCHOPHYSICS, 1986, 40 (04) :225-240