Top-Down Visual Saliency via Joint CRF and Dictionary Learning

被引：99

作者：

Yang, Jimei ^{[1
]}

Yang, Ming-Hsuan ^{[2
]}

机构：

[1] Adobe Res, San Jose, CA 95110 USA

[2] Univ Calif Merced, Sch Engn, Merced, CA USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2017年 / 39卷 / 03期

基金：

美国国家科学基金会;

关键词：

Visual saliency; top-down visual saliency; fixation prediction; dictionary learning and conditional random fields; FEATURES; ATTENTION;

D O I：

10.1109/TPAMI.2016.2547384

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.

引用

页码：576 / 588

页数：13

共 50 条

[41] Top-down and bottom-up competition in visual stimuli processing
Ligeza, Tomasz S.
Tymorek, Agnieszka D.
Wyczesany, Miroslaw
ACTA NEUROBIOLOGIAE EXPERIMENTALIS, 2017, 77 (04) : 305 - 316
[42] Distinct modes of top-down cognitive processing in the ventral visual cortex
Jo, Han-Gue
Kellermann, Thilo
Baumann, Conrad
Ito, Junji
Holthausen, Barbara Schulte
Schneider, Frank
Gruen, Sonja
Habel, Ute
NEUROIMAGE, 2019, 193 : 201 - 213
[43] Quantitative and Qualitative Differences in the Top-Down Guiding Attributes of Visual Search
Hulleman, Johan
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2020, 46 (09) : 942 - 964
[44] Top-Down but Not Bottom-Up Visual Scanning is Affected in Hereditary Pure Cerebellar Ataxia
Matsuda, Shunichi
Matsumoto, Hideyuki
Furubayashi, Toshiaki
Fukuda, Hideki
Emoto, Masaki
Hanajima, Ritsuko
Tsuji, Shoji
Ugawa, Yoshikazu
Terao, Yasuo
PLOS ONE, 2014, 9 (12):
[45] Attentional activation of the visual thalamic reticular nucleus depends on 'top-down' inputs from the primary visual cortex via corticogeniculate pathways
Montero, VM
BRAIN RESEARCH, 2000, 864 (01) : 95 - 104
[46] A model of top-down attentional control during visual search in complex scenes
Hwang, Alex D.
Higgins, Emily C.
Pomplun, Marc
JOURNAL OF VISION, 2009, 9 (05):
[47] Visual crowding involves delayed frontoparietal response and enhanced top-down modulation
Han, Qiming
Luo, Huan
EUROPEAN JOURNAL OF NEUROSCIENCE, 2019, 50 (06) : 2931 - 2941
[48] Investigating task-dependent top-down effects on overt visual attention
Betz, Torsten
Kietzmann, Tim C.
Wilming, Niklas
Koenig, Peter
JOURNAL OF VISION, 2010, 10 (03): : 1 - 14
[49] Top-down modulation of DLPFC in visual search: a study based on fMRI and TMS
Tian, Yin
Tan, Congming
Tan, Jianling
Yang, Li
Tang, Yi
CEREBRAL CORTEX, 2024, 34 (02)
[50] The importance of being expert: Top-down attentional control in visual search with photographs
Hershler, Orit
Hochstein, Shaul
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2009, 71 (07) : 1478 - 1486

← 1 2 3 4 5 →