Top-Down Visual Saliency via Joint CRF and Dictionary Learning

被引:99
|
作者
Yang, Jimei [1 ]
Yang, Ming-Hsuan [2 ]
机构
[1] Adobe Res, San Jose, CA 95110 USA
[2] Univ Calif Merced, Sch Engn, Merced, CA USA
基金
美国国家科学基金会;
关键词
Visual saliency; top-down visual saliency; fixation prediction; dictionary learning and conditional random fields; FEATURES; ATTENTION;
D O I
10.1109/TPAMI.2016.2547384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.
引用
收藏
页码:576 / 588
页数:13
相关论文
共 50 条
  • [41] Top-down and bottom-up competition in visual stimuli processing
    Ligeza, Tomasz S.
    Tymorek, Agnieszka D.
    Wyczesany, Miroslaw
    ACTA NEUROBIOLOGIAE EXPERIMENTALIS, 2017, 77 (04) : 305 - 316
  • [42] Distinct modes of top-down cognitive processing in the ventral visual cortex
    Jo, Han-Gue
    Kellermann, Thilo
    Baumann, Conrad
    Ito, Junji
    Holthausen, Barbara Schulte
    Schneider, Frank
    Gruen, Sonja
    Habel, Ute
    NEUROIMAGE, 2019, 193 : 201 - 213
  • [44] Top-Down but Not Bottom-Up Visual Scanning is Affected in Hereditary Pure Cerebellar Ataxia
    Matsuda, Shunichi
    Matsumoto, Hideyuki
    Furubayashi, Toshiaki
    Fukuda, Hideki
    Emoto, Masaki
    Hanajima, Ritsuko
    Tsuji, Shoji
    Ugawa, Yoshikazu
    Terao, Yasuo
    PLOS ONE, 2014, 9 (12):
  • [45] Attentional activation of the visual thalamic reticular nucleus depends on 'top-down' inputs from the primary visual cortex via corticogeniculate pathways
    Montero, VM
    BRAIN RESEARCH, 2000, 864 (01) : 95 - 104
  • [46] A model of top-down attentional control during visual search in complex scenes
    Hwang, Alex D.
    Higgins, Emily C.
    Pomplun, Marc
    JOURNAL OF VISION, 2009, 9 (05):
  • [47] Visual crowding involves delayed frontoparietal response and enhanced top-down modulation
    Han, Qiming
    Luo, Huan
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2019, 50 (06) : 2931 - 2941
  • [48] Investigating task-dependent top-down effects on overt visual attention
    Betz, Torsten
    Kietzmann, Tim C.
    Wilming, Niklas
    Koenig, Peter
    JOURNAL OF VISION, 2010, 10 (03): : 1 - 14
  • [49] Top-down modulation of DLPFC in visual search: a study based on fMRI and TMS
    Tian, Yin
    Tan, Congming
    Tan, Jianling
    Yang, Li
    Tang, Yi
    CEREBRAL CORTEX, 2024, 34 (02)
  • [50] The importance of being expert: Top-down attentional control in visual search with photographs
    Hershler, Orit
    Hochstein, Shaul
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2009, 71 (07) : 1478 - 1486