Frequency Perception Network for Camouflaged Object Detection

被引:33
作者
Cong, Runmin [1 ]
Sun, Mengyao [2 ]
Zhang, Sanyi [3 ]
Zhou, Xiaofei [4 ]
Zhang, Wei [1 ]
Zhao, Yao [2 ]
机构
[1] Shandong Univ, Minist Educ, Sch Control Sci & Engn, Key Lab Machine Intelligence & Syst Control, Jinan, Shandong, Peoples R China
[2] Beijing Jiaotong Univ, Inst Informat Sci, Beijing Key Lab Adv Informat Sci & Network Techno, Beijing, Peoples R China
[3] Chinese Acad Sci, State Key Lab Informat Secur SKLOIS, Inst Informat Engn, Beijing, Peoples R China
[4] Hangzhou Dianzi Univ, Sch Automat, Hangzhou, Zhejiang, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
基金
国家重点研发计划; 中国博士后科学基金; 中国国家自然科学基金;
关键词
Camouflaged object detection; Frequency perception; Coarse positioning stage; Fine localization stage;
D O I
10.1145/3581783.3612083
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camouflaged object detection (COD) aims to accurately detect objects hidden in the surrounding environment. However, the existing COD methods mainly locate camouflaged objects in the RGB domain, their performance has not been fully exploited in many challenging scenarios. Considering that the features of the camouflaged object and the background are more discriminative in the frequency domain, we propose a novel learnable and separable frequency perception mechanism driven by the semantic hierarchy in the frequency domain. Our entire network adopts a two-stage model, including a frequency-guided coarse localization stage and a detail-preserving fine localization stage. With the multi-level features extracted by the backbone, we design a flexible frequency perception module based on octave convolution for coarse positioning. Then, we design the correction fusion module to step-by-step integrate the high-level features through the prior-guided correction and cross-layer feature channel association, and finally combine them with the shallow features to achieve the detailed correction of the camouflaged objects. Compared with the currently existing models, our proposed method achieves competitive performance in three popular benchmark datasets both qualitatively and quantitatively. The code will be released at https://github.com/rmcong/FPNet_ACMMM23.
引用
收藏
页码:1179 / 1189
页数:11
相关论文
共 61 条
  • [1] Bhajantri NU, 2006, ICIT 2006: 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, P145
  • [2] Camouflaged Object Detection via Context-Aware Cross-Level Fusion
    Chen, Geng
    Liu, Si-Jie
    Sun, Yu-Jia
    Ji, Ge-Peng
    Wu, Ya-Feng
    Zhou, Tao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6981 - 6993
  • [3] A weighted block cooperative sparse representation algorithm based on visual saliency dictionary
    Chen, Rui
    Li, Fei
    Tong, Ying
    Wu, Minghu
    Jiao, Yang
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 235 - 246
  • [4] Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
    Chen, Yunpeng
    Fan, Haoqi
    Xu, Bing
    Yan, Zhicheng
    Kalantidis, Yannis
    Rohrbach, Marcus
    Yan, Shuicheng
    Feng, Jiashi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3434 - 3443
  • [5] Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
  • [6] Chongyi Li, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12353), P225, DOI 10.1007/978-3-030-58598-3_14
  • [7] Multi-Projection Fusion and Refinement Network for Salient Object Detection in 360° Omnidirectional Image
    Cong, Runmin
    Huang, Ke
    Lei, Jianjun
    Zhao, Yao
    Huang, Qingming
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9495 - 9507
  • [8] PSNet: Parallel Symmetric Network for Video Salient Object Detection
    Cong, Runmin
    Song, Weiyu
    Lei, Jianjun
    Yue, Guanghui
    Zhao, Yao
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (02): : 402 - 414
  • [9] Boundary Guided Semantic Learning for Real-Time COVID-19 Lung Infection Segmentation System
    Cong, Runmin
    Zhang, Yumo
    Yang, Ning
    Li, Haisheng
    Zhang, Xueqi
    Li, Ruochen
    Chen, Zewen
    Zhao, Yao
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2022, 68 (04) : 376 - 386
  • [10] A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels
    Cong, Runmin
    Qin, Qi
    Zhang, Chen
    Jiang, Qiuping
    Wang, Shiqi
    Zhao, Yao
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (02) : 534 - 548