Tag-Saliency: Combining bottom-up and top-down information for saliency detection

被引:30
作者
Zhu, Guokang [1 ,2 ]
Wang, Qi [1 ]
Yuan, Yuan [1 ]
机构
[1] Chinese Acad Sci, Xian Inst Opt & Precis Mech, State Key Lab Transient Opt & Photon, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710119, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Computer vision; Saliency detection; Visual attention; Image tagging; Visual media; Semantic; VISUAL-ATTENTION; RETRIEVAL; OBJECTS; COLOR; MODEL;
D O I
10.1016/j.cviu.2013.07.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the real world, people often have a habit tending to pay more attention to some things usually noteworthy, while ignore others. This phenomenon is associated with the top-down attention. Modeling this kind of attention has recently raised many interests in computer vision due to a wide range of practical applications. Majority of the existing models are based on eye-tracking or object detection. However, these methods may not apply to practical situations, because the eye movement data cannot be always recorded or there may be inscrutable objects to be handled in large-scale data sets. This paper proposes a Tag-Saliency model based on hierarchical image over-segmentation and auto-tagging, which can efficiently extract semantic information from large scale visual media data. Experimental results on a very challenging data set show that, the proposed Tag-Saliency model has the ability to locate the truly salient regions in a greater probability than other competitors. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:40 / 49
页数:10
相关论文
共 56 条
  • [1] Achanta R., IEEE INT C COMP VIS, P1597
  • [2] Achanta R., COMPUTER VISION SYST, P66
  • [3] Borji A., IEEE INT C COMP VIS, P1
  • [4] Bruce N.D.B., 2006, Advances in Neural Information Processing Systems, P155
  • [5] Visual causes versus correlates of attentional selection in dynamic scenes
    Carmi, Ran
    Itti, Laurent
    [J]. VISION RESEARCH, 2006, 46 (26) : 4333 - 4345
  • [6] Cerf M., ADV NEURAL INFORM PR, P241
  • [7] Global Contrast based Salient Region Detection
    Cheng, Ming-Ming
    Zhang, Guo-Xin
    Mitra, Niloy J.
    Huang, Xiaolei
    Hu, Shi-Min
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 409 - 416
  • [8] Task-demands can immediately reverse the effects of sensory-driven saliency in complex visual stimuli
    Einhaeuser, Wolfgang
    Rutishauser, Ueli
    Koch, Christof
    [J]. JOURNAL OF VISION, 2008, 8 (02):
  • [9] Interesting objects are visually salient
    Elazary, Lior
    Itti, Laurent
    [J]. JOURNAL OF VISION, 2008, 8 (03):
  • [10] Efficient graph-based image segmentation
    Felzenszwalb, PF
    Huttenlocher, DP
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) : 167 - 181