Top-Down Visual Saliency via Joint CRF and Dictionary Learning

被引:99
|
作者
Yang, Jimei [1 ]
Yang, Ming-Hsuan [2 ]
机构
[1] Adobe Res, San Jose, CA 95110 USA
[2] Univ Calif Merced, Sch Engn, Merced, CA USA
基金
美国国家科学基金会;
关键词
Visual saliency; top-down visual saliency; fixation prediction; dictionary learning and conditional random fields; FEATURES; ATTENTION;
D O I
10.1109/TPAMI.2016.2547384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.
引用
收藏
页码:576 / 588
页数:13
相关论文
共 50 条
  • [31] Statistical learning in the absence of explicit top-down attention
    Duncan, Dock
    Theeuwes, Jan
    CORTEX, 2020, 131 : 54 - 65
  • [32] Neural mechanisms of top-down selection during visual search
    Bichot, NP
    PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 780 - 783
  • [33] Visual working memory organization is subject to top-down control
    van Lamsweerde, Amanda E.
    Beck, Melissa R.
    Johnson, Jeffrey S.
    PSYCHONOMIC BULLETIN & REVIEW, 2016, 23 (04) : 1181 - 1189
  • [34] Top-down weighting of visual dimensions: Behavioral and electrophysiological evidence
    Toellner, Thomas
    Zehetleitner, Michael
    Gramann, Klaus
    Mueller, Hermann J.
    VISION RESEARCH, 2010, 50 (14) : 1372 - 1381
  • [35] Top-down and bottom-up control of visual selection
    Theeuwes, Jan
    ACTA PSYCHOLOGICA, 2010, 135 (02) : 77 - 99
  • [36] Interaction between Bottom-up Saliency and Top-down Control: How Saliency Maps Are Created in the Human Brain
    Melloni, Lucia
    van Leeuwen, Sara
    Alink, Arjen
    Mueller, Notger G.
    CEREBRAL CORTEX, 2012, 22 (12) : 2943 - 2952
  • [37] Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
    Kesen, Ilker
    Can, Ozan Arkan
    Erdem, Erkut
    Erdem, Aykut
    Yuret, Deniz
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4609 - 4619
  • [38] Temporal learning of bottom-up connections via spatially nonspecific top-down inputs
    Lee, Jung Hoon
    Kim, Mean-Hwan
    Vijayan, Sujith
    NEUROCOMPUTING, 2020, 411 : 128 - 138
  • [39] Learning top-down gain control of feature selectivity in a recurrent network model of a visual cortical area
    Schwabe, L
    Obermayer, K
    VISION RESEARCH, 2005, 45 (25-26) : 3202 - 3209
  • [40] Gastric Lymph nodes detection Based on Visual Saliency and Dictionary Learning
    Tong, Nuo
    Gou, Shuiping
    Yao, Yao
    Wang, Chenjiao
    Bai, Jing
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2810 - 2813