Decoupled dynamic group equivariant filter for saliency prediction on omnidirectional image

被引:1
作者
Zhu, Dandan [1 ,2 ]
Zhang, Kaiwei [3 ]
Zhang, Guokai [4 ]
Zhou, Qiangqiang [5 ]
Min, Xiongkuo [3 ]
Zhai, Guangtao [3 ]
Yang, Xiaokang [2 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Key Lab Artificial Intelligence, Minist Educ, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai 200240, Peoples R China
[4] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
[5] Jiangxi Normal Univ, Sch Software, Nanchang 330022, Peoples R China
基金
中国国家自然科学基金;
关键词
Omnidirectional image; Saliency prediction; Decoupled dynamic group equivariant filter; Content; -adaptive; Lightweight model; MODEL;
D O I
10.1016/j.neucom.2022.09.107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current saliency prediction models based on convolutional neural networks (CNNs) achieve solid improvement in predicting human attention on omnidirectional image (ODI). However, these models that employ standard convolution have two main shortcomings: content-agnostic and computation -intensive. To address these two shortcomings, we propose a decoupled dynamic group equivariant filter (DDGF). Specifically, inspired by the attention mechanism that adopts light-weight branches for estimat-ing spatial and channel attention, we decouple group equivariant convolution (i.e. p4 convolution) into spatial and channel dynamic group equivariant filters. Such a design not only makes p4 convolution filter adaptive to ODI content, but also considerably reduces computational cost. To our best knowledge, the DDGF is the first decoupled dynamic convolution filter that applied to the task of saliency prediction. Meanwhile, we observe that it is effective and efficient when replacing standard group equivariant con-volution with DDGF in ODI saliency prediction. Experimental results show that the proposed DDGF can achieve superior performance in comparison with other state-of-the-art methods. Additionally, we con-duct ablation experiments to verify the effectiveness of each component of the proposed DDGF.CO 2022 Published by Elsevier B.V.
引用
收藏
页码:111 / 121
页数:11
相关论文
共 60 条
[1]   SaltiNet: Scan-path Prediction on 360 Degree Images using Saliency Volumes [J].
Assens, Marc ;
Giro-i-Nieto, Xavier ;
McGuinness, Kevin ;
O'Connor, Noel E. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2331-2338
[2]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[3]   Recent Advances in Saliency Estimation for Omnidirectional Images, Image Groups, and Video Sequences [J].
Buzzelli, Marco .
APPLIED SCIENCES-BASEL, 2020, 10 (15)
[4]   What Do Different Evaluation Metrics Tell Us About Saliency Models? [J].
Bylinskii, Zoya ;
Judd, Tilke ;
Oliva, Aude ;
Torralba, Antonio ;
Durand, Fredo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (03) :740-757
[5]  
Chao FY, 2018, IEEE INT CONF MULTI
[6]  
Chen Y., 2021, 2021 IEEE INT C MULT, P1
[7]   DPANet: Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection [J].
Chen, Zuyao ;
Cong, Runmin ;
Xu, Qianqian ;
Huang, Qingming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :7012-7024
[8]  
Cohen TS, 2016, PR MACH LEARN RES, V48
[9]   Predicting Human Eye Fixations via an LSTM-Based Saliency Attentive Model [J].
Cornia, Marcella ;
Baraldi, Lorenzo ;
Serra, Giuseppe ;
Cucchiara, Rita .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (10) :5142-5154
[10]  
Cornia M, 2016, INT C PATT RECOG, P3488, DOI 10.1109/ICPR.2016.7900174