Visual Saliency Based on Scale-Space Analysis in the Frequency Domain

被引:449
|
作者
Li, Jian [1 ]
Levine, Martin D. [2 ,3 ]
An, Xiangjing [1 ]
Xu, Xin [1 ]
He, Hangen [1 ]
机构
[1] Natl Univ Def Technol, Inst Automat, Changsha 410073, Hunan, Peoples R China
[2] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 2A7, Canada
[3] McGill Univ, CIM, Montreal, PQ H3A 2A7, Canada
基金
中国国家自然科学基金;
关键词
Visual attention; saliency; hypercomplex Fourier transform; eye tracking; scale space analysis; ATTENTION; IMAGE; MODEL; SEARCH;
D O I
10.1109/TPAMI.2012.147
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the issue of visual saliency from three perspectives. First, we consider saliency detection as a frequency domain analysis problem. Second, we achieve this by employing the concept of nonsaliency. Third, we simultaneously consider the detection of salient regions of different size. The paper proposes a new bottom-up paradigm for detecting visual saliency, characterized by a scale-space analysis of the amplitude spectrum of natural images. We show that the convolution of the image amplitude spectrum with a low-pass Gaussian kernel of an appropriate scale is equivalent to an image saliency detector. The saliency map is obtained by reconstructing the 2D signal using the original phase and the amplitude spectrum, filtered at a scale selected by minimizing saliency map entropy. A Hypercomplex Fourier Transform performs the analysis in the frequency domain. Using available databases, we demonstrate experimentally that the proposed model can predict human fixation data. We also introduce a new image database and use it to show that the saliency detector can highlight both small and large salient regions, as well as inhibit repeated distractors in cluttered images. In addition, we show that it is able to predict salient regions on which people focus their attention.
引用
收藏
页码:996 / 1010
页数:15
相关论文
共 50 条
  • [1] Fusion of infrared and visible images based on saliency scale-space in frequency domain
    Chen, Yanfei
    Sang, Nong
    Dan, Zhiping
    MIPPR 2015: PATTERN RECOGNITION AND COMPUTER VISION, 2015, 9813
  • [2] Modified Scale-Space Analysis in Frequency Domain Based on Adaptive Multiscale Gaussian Filter for Saliency Detection
    Jaemsiri, Jenjira
    Titijaroonroj, Taravichet
    Rungrattanaubol, Jaratsri
    2019 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2019), 2019, : 218 - 223
  • [3] Visual saliency detection: From space to frequency
    Chen, Dongyue
    Jia, Tong
    Wu, Chengdong
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 44 : 57 - 68
  • [4] Saliency Detection Based on Frequency and Spatial Domain Analysis
    Li, Jian
    Levine, Martin D.
    An, Xiangjing
    He, Hangen
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [5] Saliency Detection Method Based on Multiscale Analysis in Frequency Domain
    Wu Q.
    Yu Y.
    Yang J.
    Shao K.
    Kang Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (01): : 68 - 78
  • [6] Finding Regions of Interest Based on Scale-Space Keypoint Detection
    Zeng, Ming
    Yang, Ting
    Li, Youfu
    Meng, Qinghao
    Liu, Jian
    Han, Tiemao
    ADVANCES IN COMPUTER SCIENCE AND EDUCATION APPLICATIONS, PT II, 2011, 202 : 428 - +
  • [7] Scale-Invariant Amplitude Spectrum Modulation for Visual Saliency Detection
    Chen, Dongyue
    Chu, Hao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (08) : 1206 - 1214
  • [8] Visual Saliency Detection Based on color Frequency Features under Bayesian framework
    Ayoub, Naeem
    Gao, Zhenguo
    Chen, Danjie
    Tobji, Rachida
    Yao, Nianmin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (02): : 676 - 692
  • [9] Principal Component Analysis-Based Visual Saliency Detection
    Yang, Bing
    Zhang, Xiaoyun
    Chen, Li
    Gao, Zhiyong
    IEEE TRANSACTIONS ON BROADCASTING, 2016, 62 (04) : 842 - 854
  • [10] On the Gaussian scale-space
    Iijima, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (07) : 1162 - 1164