RANSP: Ranking attention network for saliency prediction on omnidirectional images

被引:0
|
作者
Zhu, Dandan [1 ]
Chen, Yongqing [2 ]
Min, Xiongkuo [3 ]
Zhu, Yucheng [3 ]
Zhang, Guokai [4 ]
Zhou, Qiangqiang [5 ]
Zhai, Guangtao [3 ]
Yang, Xiaokang [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Key Lab Artificial Intelligence, Minist Educ, Shanghai 200240, Peoples R China
[2] Hainan Air Traff Management Sub Bureau, Haikou 570000, Hainan, Peoples R China
[3] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai 200240, Peoples R China
[4] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
[5] Shanghai Business Sch, Sch Informat & Comp, Shanghai 201400, Peoples R China
基金
中国国家自然科学基金;
关键词
Omnidirectional images; Saliency prediction; Ranking attention; Part-guided attention; Channel-wise feature maps; VISUAL-ATTENTION; COLOR; MODEL;
D O I
10.1016/j.neucom.2021.06.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various convolutional neural network (CNN)-based methods have shown the ability to boost the performance of saliency prediction on omnidirectional images (ODIs). However, these methods are limited by sub-optimal accuracy, because not all the features extracted by the CNN model are useful for the final fine-grained saliency prediction. Some features are redundant and may have negative impact on the final fine-grained saliency prediction. To tackle this problem, we propose a novel Ranking Attention Network for saliency prediction (RANSP) of head fixations on ODIs. Specifically, the part-guided attention (PA) module and channel-wise feature (CF) extraction module are integrated in a unified framework and are trained in an end-to-end manner for fine-grained saliency prediction. To better utilize the channel-wise feature maps, we further propose a new Ranking Attention Module (RAM), which automatically ranks and selects these feature maps based on scores for fine-grained saliency prediction. Extensive experiments and ablation studies are conducted to show the effectiveness of our method for saliency prediction on ODIs. (c) 2021 Published by Elsevier B.V.
引用
收藏
页码:118 / 128
页数:11
相关论文
共 50 条
  • [41] Recent Advances in Saliency Estimation for Omnidirectional Images, Image Groups, and Video Sequences
    Buzzelli, Marco
    APPLIED SCIENCES-BASEL, 2020, 10 (15):
  • [42] Focusing Attention Network for Answer Ranking
    Xie, Yufei
    Liu, Shuchun
    Yao, Tangren
    Peng, Yao
    Lu, Zhao
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3384 - 3390
  • [43] AUTOMATIC PREDICTION OF SALIENCY ON JPEG DISTORTED IMAGES
    Mittal, Anish
    Moorthy, Anush K.
    Bovik, Alan C.
    Cormack, Lawrence K.
    2011 THIRD INTERNATIONAL WORKSHOP ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2011, : 195 - 200
  • [44] Scanpath and saliency prediction on 360 degree images
    Assens, Marc
    Giro-i-Nieto, Xavier
    McGuinness, Kevin
    O'Connor, Noel E.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 69 : 8 - 14
  • [45] Does text attract attention on e-commerce images: A novel saliency prediction dataset and method
    Jiang, Lai
    Li, Yifei
    Li, Shengxi
    Xu, Mai
    Lei, Se
    Guo, Yichen
    Huang, Bo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2078 - 2087
  • [46] Regularized Random Walk Ranking for Co-Saliency Detection in images
    Bardhan, Sayanti
    Jacob, Shibu
    ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
  • [47] Saliency Prediction Network for 360° Videos
    Zhang, Youqiang
    Dai, Feng
    Ma, Yike
    Li, Hongliang
    Zhao, Qiang
    Zhang, Yongdong
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (01) : 27 - 37
  • [48] A multiscale dilated dense convolutional network for saliency prediction with instance-level attention competition
    Li, Hao
    Qi, Fei
    Shi, Guangming
    Lin, Chunhuan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 64
  • [49] Transformer-based multi-level attention integration network for video saliency prediction
    Rui Tan
    Minghui Sun
    Yanhua Liang
    Multimedia Tools and Applications, 2025, 84 (13) : 11833 - 11854
  • [50] Attention-based contextual interaction asymmetric network for RGB-D saliency prediction
    Zhang, Xinyue
    Jin, Ting
    Zhou, Wujie
    Lei, Jingsheng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74 (74)