Promoting camouflaged object detection through novel edge-target interaction and frequency-spatial fusion

被引:0
作者
Guan, Juwei [1 ]
Qian, Weiqi [1 ]
Zhu, Tongxin [1 ]
Fang, Xiaolin [1 ,2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 211189, Peoples R China
[2] Southeast Univ, Key Lab Comp Network & Informat Integrat, Minist Educ, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Camouflage object detection; Edge-target interaction; Frequency-spatial fusion; Frequency decomposition; NETWORK; NET;
D O I
10.1016/j.neucom.2024.129064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The realm of camouflage object detection endeavors to precisely discern concealed targets that blend seamlessly into their surroundings. Recently, numerous research endeavors have converged on unraveling the disguise tactics employed by such objects, striving for efficient and accurate detection results. While these methods have garnered notable advancements in unveiling camouflaged targets, they still grapple with challenges posed by the high similarity between objects and their backgrounds, resulting in misidentifications, detection oversights, and the loss of intricate details. In light of these challenges, this paper introduces an innovative EF Net, aimed at elevating the integrity of target recognition and the finesse of detail capture through deepened interactions between E dge-target features and F requency-spatial information. Notably, recognizing that the edges of camouflaged objects harbor a wealth of intrinsic details and serve as crucial delimiters distinguishing them from their surroundings, we have devised a dual-path architecture, where an auxiliary Edge Detection (ED) branch collaborates seamlessly with the primary Object Segmentation (OS) branch. Within the OS branch, we meticulously crafted the Edge-induced Segmentation Refinement module (ESR), which ingeniously harnesses the refined edge information provided by the ED branch to bolster target segmentation accuracy and augment detail fidelity. Moreover, in the supporting ED branch, we propose the Structure-induced Edge Enhancement module (SEE), designed to accentuate edge information while suppressing irrelevant noise with the assistance of the OS branch, thereby ensuring the precise extraction of edge information. To further enrich the information on dual-path, we innovatively introduce a Frequency Decomposition Module (FDM), which decomposes images into low- and high-frequency components, tailored to enhance target segmentation and edge detection, respectively. Additionally, to ensure seamless integration of frequency-domain and spatial- domain information, we devised the versatile Frequency-Spatial Domain Mixer (FSDM), achieving precise information alignment and fusion of these two domains. Quantitative and qualitative experimental results demonstrate that our proposed EFNet significantly outperforms existing state-of-the-art methods on four widely used benchmark datasets.
引用
收藏
页数:12
相关论文
共 52 条
  • [1] Surface Defect Detection Methods for Industrial Products: A Review
    Chen, Yajun
    Ding, Yuanyuan
    Zhao, Fan
    Zhang, Erhu
    Wu, Zhangnan
    Shao, Linhao
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (16):
  • [2] Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
    Chen, Yunpeng
    Fan, Haoqi
    Xu, Bing
    Yan, Zhicheng
    Kalantidis, Yannis
    Rohrbach, Marcus
    Yan, Shuicheng
    Feng, Jiashi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3434 - 3443
  • [3] Chen Zuyao, 2020, arXiv
  • [4] Structure-Measure: A New Way to Evaluate Foreground Maps
    Cheng, Ming-Ming
    Fan, Deng-Ping
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (09) : 2622 - 2638
  • [5] Frequency Perception Network for Camouflaged Object Detection
    Cong, Runmin
    Sun, Mengyao
    Zhang, Sanyi
    Zhou, Xiaofei
    Zhang, Wei
    Zhao, Yao
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1179 - 1189
  • [6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [7] Deep Residual Learning in the JPEG Transform Domain
    Ehrlich, Max
    Davis, Larry
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3483 - 3492
  • [8] Camouflaged Object Detection
    Fan, Deng-Ping
    Ji, Ge-Peng
    Sun, Guolei
    Cheng, Ming-Ming
    Shen, Jianbing
    Shao, Ling
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2774 - 2784
  • [9] Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698
  • [10] IdeNet: Making Neural Network Identify Camouflaged Objects Like Creatures
    Guan, Juwei
    Fang, Xiaolin
    Zhu, Tongxin
    Cai, Zhipeng
    Ling, Zhen
    Yang, Ming
    Luo, Junzhou
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4824 - 4839