Multidimensional fusion of frequency and spatial domain information for enhanced camouflaged object detection

被引:2
|
作者
Wang, Tingran [1 ]
Yu, Zaiyang [2 ,4 ]
Fang, Jianwei [3 ]
Xie, Jinlong [2 ,4 ]
Yang, Feng [1 ]
Zhang, Huang [2 ]
Zhang, Liping [2 ]
Du, Minghua [5 ]
Li, Lusi [6 ]
Ning, Xin [2 ]
机构
[1] China Univ Min & Technol Beijing, Sch Artificial Intelligence, Beijing 100083, Peoples R China
[2] Chinese Acad Sci, Inst Semicond, AnnLab, Beijing 100085, Peoples R China
[3] China Unicom Software Res Inst, Beijing 100048, Peoples R China
[4] Univ Chinese Acad Sci, Coll Mat Sci & Optoelect Technol, Beijing 100049, Peoples R China
[5] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 1, Dept Emergency, Beijing 100853, Peoples R China
[6] Old Dominion Univ, Dept Comp Sci, Norfolk, VA 23529 USA
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Camouflaged Object Detection; Spatial-frequency fusion domain; Fast Fourier Convolution; SALIENT; NETWORK;
D O I
10.1016/j.inffus.2024.102871
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camouflaged Object Detection (COD) remains a challenging task in computer vision due to the difficulty of distinguishing highly similar targets from complex backgrounds. Existing COD methods often struggle with scene understanding and information utilization, resulting in insufficient accuracy, leading to issues such as background misactivations, target localization losses, and edge blurring. To address these challenges, we propose a novel approach called the Mask and Attention Modulated Information Focusing Network (MAMIFNet), designed to improve COD in the spatial-frequency fusion domain. MAMIFNet introduces a Mask and Attention Modulated Fast Fourier Convolution (MAM-FFC) operator, which adaptively enhances global and local information across spatial and frequency domains through cross-domain masking and attention mechanisms. This operator enables amore comprehensive scene understanding, reducing regional errors such as wide-range false activations and target area losses. To further optimize scene information utilization, the Critical Cue Perception Module (CCPM) is introduced and built upon the MAM-FFC operator. The CCPM refines target foreground focus and foreground-background contrast, mining discriminative cues to improve target localization and edge detection accuracy. Experimental evaluations on four benchmark datasets demonstrate the competitive performance of the proposed method, validating its effectiveness in COD. The code is available at the link below https://github.com/CHANTILLY2023/MAMIFNet.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Frequency-Spatial Entanglement Learning for Camouflaged Object Detection
    Sun, Yanguang
    Xu, Chunyan
    Yang, Jian
    Xuan, Hanyu
    Luo, Lei
    COMPUTER VISION - ECCV 2024, PT VI, 2025, 15064 : 343 - 360
  • [2] Frequency-Guided Spatial Adaptation for Camouflaged Object Detection
    Zhang, Shizhou
    Kong, Dexuan
    Xing, Yinghui
    Lu, Yue
    Ran, Lingyan
    Liang, Guoqiang
    Wang, Hexu
    Zhang, Yanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 72 - 83
  • [3] Detecting Camouflaged Object in Frequency Domain
    Zhong, Yijie
    Li, Bo
    Tang, Lv
    Kuang, Senyun
    Wu, Shuang
    Ding, Shouhong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4494 - 4503
  • [4] Fusion of Foreground Object, Spatial and Frequency Domain Motion Information for Video Summarization
    Salehin, Md. Musfequs
    Paul, Manoranjan
    IMAGE AND VIDEO TECHNOLOGY - PSIVT 2015 WORKSHOPS, 2016, 9555 : 319 - 331
  • [5] Edge Perception Camouflaged Object Detection Under Frequency Domain Reconstruction
    Liu, Zijian
    Deng, Xiaoheng
    Jiang, Ping
    Lv, Conghao
    Min, Geyong
    Wang, Xin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 10194 - 10207
  • [6] Promoting camouflaged object detection through novel edge-target interaction and frequency-spatial fusion
    Guan, Juwei
    Qian, Weiqi
    Zhu, Tongxin
    Fang, Xiaolin
    NEUROCOMPUTING, 2025, 617
  • [7] Spatial frequency enhanced salient object detection
    Li, Xiaofang
    Wang, Yi
    Wang, Tianzhu
    Wang, Ruili
    INFORMATION SCIENCES, 2023, 647
  • [8] Frequency Perception Network for Camouflaged Object Detection
    Cong, Runmin
    Sun, Mengyao
    Zhang, Sanyi
    Zhou, Xiaofei
    Zhang, Wei
    Zhao, Yao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1179 - 1189
  • [9] Frequency Representation Integration for Camouflaged Object Detection
    Xie, Chenxi
    Xia, Changqun
    Yu, Tianshu
    Li, Jia
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1789 - 1797
  • [10] Frequency-aware Camouflaged Object Detection
    Lin, Jiaying
    Tan, Xin
    Xu, Ke
    Ma, Lizhuang
    Lau, Rynsonw. H.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)