Top-Down Visual Saliency via Joint CRF and Dictionary Learning

被引:99
|
作者
Yang, Jimei [1 ]
Yang, Ming-Hsuan [2 ]
机构
[1] Adobe Res, San Jose, CA 95110 USA
[2] Univ Calif Merced, Sch Engn, Merced, CA USA
基金
美国国家科学基金会;
关键词
Visual saliency; top-down visual saliency; fixation prediction; dictionary learning and conditional random fields; FEATURES; ATTENTION;
D O I
10.1109/TPAMI.2016.2547384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.
引用
收藏
页码:576 / 588
页数:13
相关论文
共 50 条
  • [1] Visual saliency detection via integrating bottom-up and top-down information
    Shariatmadar, Zahra Sadat
    Faez, Karim
    OPTIK, 2019, 178 : 1195 - 1207
  • [2] Top-down saliency detection driven by visual classification
    Murabito, Francesca
    Spampinato, Concetto
    Palazzo, Simone
    Giordano, Daniela
    Pogorelov, Konstantin
    Riegler, Michael
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 172 : 67 - 76
  • [3] Top-Down Saliency Detection via Contextual Pooling
    Zhu, Jun
    Qiu, Yuanyuan
    Zhang, Rui
    Huang, Jun
    Zhang, Wenjun
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (01): : 33 - 46
  • [4] Bottom-up saliency and top-down learning in the primary visual cortex of monkeys
    Yan, Yin
    Zhaoping, Li
    Li, Wu
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (41) : 10499 - 10504
  • [5] SUN: Top-down saliency using natural statistics
    Kanan, Christopher
    Tong, Mathew H.
    Zhang, Lingyun
    Cottrell, Garrison W.
    VISUAL COGNITION, 2009, 17 (6-7) : 979 - 1003
  • [6] Integrating bottom-up and top-down visual stimulus for saliency detection in news video
    Wu, Bo
    Xu, Linfeng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 73 (03) : 1053 - 1075
  • [7] Top-down Gamma Saliency - Learning to Search for Objects in Complex Scenes
    Burt, Ryan
    Principe, Jose C.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [8] Boosting Bottom-up and Top-down Visual Features for Saliency Estimation
    Borji, Ali
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 438 - 445
  • [9] Combining Top-down and Bottom-up Visual Saliency for Firearms Localization
    Ardizzone, Edoardo
    Gallea, Roberto
    La Cascia, Marco
    Mazzola, Giuseppe
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS (SIGMAP), 2014, : 25 - 32
  • [10] A top-down saliency model with goal relevance
    Tanner, James
    Itti, Laurent
    JOURNAL OF VISION, 2019, 19 (01): : 1 - 16