Promoting camouflaged object detection through novel edge-target interaction and frequency-spatial fusion

被引:0
作者
Guan, Juwei [1 ]
Qian, Weiqi [1 ]
Zhu, Tongxin [1 ]
Fang, Xiaolin [1 ,2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 211189, Peoples R China
[2] Southeast Univ, Key Lab Comp Network & Informat Integrat, Minist Educ, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Camouflage object detection; Edge-target interaction; Frequency-spatial fusion; Frequency decomposition; NETWORK; NET;
D O I
10.1016/j.neucom.2024.129064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The realm of camouflage object detection endeavors to precisely discern concealed targets that blend seamlessly into their surroundings. Recently, numerous research endeavors have converged on unraveling the disguise tactics employed by such objects, striving for efficient and accurate detection results. While these methods have garnered notable advancements in unveiling camouflaged targets, they still grapple with challenges posed by the high similarity between objects and their backgrounds, resulting in misidentifications, detection oversights, and the loss of intricate details. In light of these challenges, this paper introduces an innovative EF Net, aimed at elevating the integrity of target recognition and the finesse of detail capture through deepened interactions between E dge-target features and F requency-spatial information. Notably, recognizing that the edges of camouflaged objects harbor a wealth of intrinsic details and serve as crucial delimiters distinguishing them from their surroundings, we have devised a dual-path architecture, where an auxiliary Edge Detection (ED) branch collaborates seamlessly with the primary Object Segmentation (OS) branch. Within the OS branch, we meticulously crafted the Edge-induced Segmentation Refinement module (ESR), which ingeniously harnesses the refined edge information provided by the ED branch to bolster target segmentation accuracy and augment detail fidelity. Moreover, in the supporting ED branch, we propose the Structure-induced Edge Enhancement module (SEE), designed to accentuate edge information while suppressing irrelevant noise with the assistance of the OS branch, thereby ensuring the precise extraction of edge information. To further enrich the information on dual-path, we innovatively introduce a Frequency Decomposition Module (FDM), which decomposes images into low- and high-frequency components, tailored to enhance target segmentation and edge detection, respectively. Additionally, to ensure seamless integration of frequency-domain and spatial- domain information, we devised the versatile Frequency-Spatial Domain Mixer (FSDM), achieving precise information alignment and fusion of these two domains. Quantitative and qualitative experimental results demonstrate that our proposed EFNet significantly outperforms existing state-of-the-art methods on four widely used benchmark datasets.
引用
收藏
页数:12
相关论文
共 52 条
  • [51] Zhu HW, 2022, AAAI CONF ARTIF INTE, P3608
  • [52] DeepCrack: Learning Hierarchical Convolutional Features for Crack Detection
    Zou, Qin
    Zhang, Zheng
    Li, Qingquan
    Qi, Xianbiao
    Wang, Qian
    Wang, Song
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (03) : 1498 - 1512