Top-Down Visual Saliency via Joint CRF and Dictionary Learning

被引：99

作者：

Yang, Jimei ^{[1
]}

Yang, Ming-Hsuan ^{[2
]}

机构：

[1] Adobe Res, San Jose, CA 95110 USA

[2] Univ Calif Merced, Sch Engn, Merced, CA USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2017年 / 39卷 / 03期

基金：

美国国家科学基金会;

关键词：

Visual saliency; top-down visual saliency; fixation prediction; dictionary learning and conditional random fields; FEATURES; ATTENTION;

D O I：

10.1109/TPAMI.2016.2547384

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.

引用

页码：576 / 588

页数：13

共 50 条

[31] Statistical learning in the absence of explicit top-down attention
Duncan, Dock
Theeuwes, Jan
CORTEX, 2020, 131 : 54 - 65
[32] Neural mechanisms of top-down selection during visual search
Bichot, NP
PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 780 - 783
[33] Visual working memory organization is subject to top-down control
van Lamsweerde, Amanda E.
Beck, Melissa R.
Johnson, Jeffrey S.
PSYCHONOMIC BULLETIN & REVIEW, 2016, 23 (04) : 1181 - 1189
[34] Top-down weighting of visual dimensions: Behavioral and electrophysiological evidence
Toellner, Thomas
Zehetleitner, Michael
Gramann, Klaus
Mueller, Hermann J.
VISION RESEARCH, 2010, 50 (14) : 1372 - 1381
[35] Top-down and bottom-up control of visual selection
Theeuwes, Jan
ACTA PSYCHOLOGICA, 2010, 135 (02) : 77 - 99
[36] Interaction between Bottom-up Saliency and Top-down Control: How Saliency Maps Are Created in the Human Brain
Melloni, Lucia
van Leeuwen, Sara
Alink, Arjen
Mueller, Notger G.
CEREBRAL CORTEX, 2012, 22 (12) : 2943 - 2952
[37] Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
Kesen, Ilker
Can, Ozan Arkan
Erdem, Erkut
Erdem, Aykut
Yuret, Deniz
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4609 - 4619
[38] Temporal learning of bottom-up connections via spatially nonspecific top-down inputs
Lee, Jung Hoon
Kim, Mean-Hwan
Vijayan, Sujith
NEUROCOMPUTING, 2020, 411 : 128 - 138
[39] Learning top-down gain control of feature selectivity in a recurrent network model of a visual cortical area
Schwabe, L
Obermayer, K
VISION RESEARCH, 2005, 45 (25-26) : 3202 - 3209
[40] Gastric Lymph nodes detection Based on Visual Saliency and Dictionary Learning
Tong, Nuo
Gou, Shuiping
Yao, Yao
Wang, Chenjiao
Bai, Jing
PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2810 - 2813

← 1 2 3 4 5 →