Frequency Perception Network for Camouflaged Object Detection

被引:54
作者
Cong, Runmin [1 ]
Sun, Mengyao [2 ]
Zhang, Sanyi [3 ]
Zhou, Xiaofei [4 ]
Zhang, Wei [1 ]
Zhao, Yao [2 ]
机构
[1] Shandong Univ, Minist Educ, Sch Control Sci & Engn, Key Lab Machine Intelligence & Syst Control, Jinan, Shandong, Peoples R China
[2] Beijing Jiaotong Univ, Inst Informat Sci, Beijing Key Lab Adv Informat Sci & Network Techno, Beijing, Peoples R China
[3] Chinese Acad Sci, State Key Lab Informat Secur SKLOIS, Inst Informat Engn, Beijing, Peoples R China
[4] Hangzhou Dianzi Univ, Sch Automat, Hangzhou, Zhejiang, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
基金
中国国家自然科学基金; 国家重点研发计划; 中国博士后科学基金;
关键词
Camouflaged object detection; Frequency perception; Coarse positioning stage; Fine localization stage;
D O I
10.1145/3581783.3612083
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camouflaged object detection (COD) aims to accurately detect objects hidden in the surrounding environment. However, the existing COD methods mainly locate camouflaged objects in the RGB domain, their performance has not been fully exploited in many challenging scenarios. Considering that the features of the camouflaged object and the background are more discriminative in the frequency domain, we propose a novel learnable and separable frequency perception mechanism driven by the semantic hierarchy in the frequency domain. Our entire network adopts a two-stage model, including a frequency-guided coarse localization stage and a detail-preserving fine localization stage. With the multi-level features extracted by the backbone, we design a flexible frequency perception module based on octave convolution for coarse positioning. Then, we design the correction fusion module to step-by-step integrate the high-level features through the prior-guided correction and cross-layer feature channel association, and finally combine them with the shallow features to achieve the detailed correction of the camouflaged objects. Compared with the currently existing models, our proposed method achieves competitive performance in three popular benchmark datasets both qualitatively and quantitatively. The code will be released at https://github.com/rmcong/FPNet_ACMMM23.
引用
收藏
页码:1179 / 1189
页数:11
相关论文
共 61 条
[1]  
Bhajantri NU, 2006, ICIT 2006: 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, P145
[2]   Camouflaged Object Detection via Context-Aware Cross-Level Fusion [J].
Chen, Geng ;
Liu, Si-Jie ;
Sun, Yu-Jia ;
Ji, Ge-Peng ;
Wu, Ya-Feng ;
Zhou, Tao .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) :6981-6993
[3]   A weighted block cooperative sparse representation algorithm based on visual saliency dictionary [J].
Chen, Rui ;
Li, Fei ;
Tong, Ying ;
Wu, Minghu ;
Jiao, Yang .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) :235-246
[4]   Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution [J].
Chen, Yunpeng ;
Fan, Haoqi ;
Xu, Bing ;
Yan, Zhicheng ;
Kalantidis, Yannis ;
Rohrbach, Marcus ;
Yan, Shuicheng ;
Feng, Jiashi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3434-3443
[5]  
Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
[6]   Multi-Projection Fusion and Refinement Network for Salient Object Detection in 360° Omnidirectional Image [J].
Cong, Runmin ;
Huang, Ke ;
Lei, Jianjun ;
Zhao, Yao ;
Huang, Qingming ;
Kwong, Sam .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) :9495-9507
[7]   PSNet: Parallel Symmetric Network for Video Salient Object Detection [J].
Cong, Runmin ;
Song, Weiyu ;
Lei, Jianjun ;
Yue, Guanghui ;
Zhao, Yao ;
Kwong, Sam .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (02) :402-414
[8]   Boundary Guided Semantic Learning for Real-Time COVID-19 Lung Infection Segmentation System [J].
Cong, Runmin ;
Zhang, Yumo ;
Yang, Ning ;
Li, Haisheng ;
Zhang, Xueqi ;
Li, Ruochen ;
Chen, Zewen ;
Zhao, Yao ;
Kwong, Sam .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2022, 68 (04) :376-386
[9]   A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels [J].
Cong, Runmin ;
Qin, Qi ;
Zhang, Chen ;
Jiang, Qiuping ;
Wang, Shiqi ;
Zhao, Yao ;
Kwong, Sam .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (02) :534-548
[10]   CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection [J].
Cong, Runmin ;
Lin, Qinwei ;
Zhang, Chen ;
Li, Chongyi ;
Cao, Xiaochun ;
Huang, Qingming ;
Zhao, Yao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :6800-6815