CSFIN: A lightweight network for camouflaged object detection via cross-stage feature interaction

被引:0
作者
Li, Minghong [1 ,2 ]
Zhao, Yuqian [1 ,2 ]
Zhang, Fan [1 ,2 ]
Gui, Gui [1 ,2 ]
Luo, Biao [1 ,2 ]
Yang, Chunhua [1 ,2 ]
Gui, Weihua [1 ,2 ]
Chang, Kan [3 ,4 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410083, Hunan, Peoples R China
[2] Cent South Univ, Key Lab Ind Intelligence & Syst, Minist Educ, Changsha 410083, Hunan, Peoples R China
[3] Guangxi Univ, Sch Comp & Elect Informat, Nanning 530004, Guangxi, Peoples R China
[4] Guangxi Univ, Guangxi Key Lab Multimedia Commun & Network Techno, Nanning 530004, Guangxi, Peoples R China
关键词
Deep learning; Convolutional neural network; Camouflaged object detection; Feature extraction;
D O I
10.1016/j.eswa.2025.126451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camouflaged object detection (COD) aims to identify the objects that are hidden in their surroundings, which is a very challenging task due to factors like complex contours and high similarity to the background. Existing COD methods often face the issue of neglecting the impact of feature transformation after the backbone, which leads to accuracy constraints in the subsequent decoding step. Additionally, most COD methods typically demand a greater number of parameters and higher computational complexity to achieve notable performance. To tackle the above issues, a lightweight cross-stage feature interaction network (CSFIN) is proposed in this paper. In the CSFIN, a cross-stage feature perception neck (CSFPN) is designed to achieve effective feature transformation, which improves each stage backbone feature with its adjacent stage information, thus obtaining more comprehensive feature representations for the final decoding. In CSFPN, a bi-directional feature interaction module (BFIM) is introduced between two adjacent stage features to deeply aggregate informative camouflaged cues. As the key component of the BFIM, a multi-scale cross-attention modulation block is constructed to fully investigate the multi-scale long-range dependency between two input features, which can effectively incorporate external guidance. Extensive experimental results illustrate that the CSFIN outperforms other COD models with fewer parameters and lower computational complexity.
引用
收藏
页数:14
相关论文
共 57 条
  • [1] An Automatic Graphic Pattern Generation Algorithm and Its Application to the Multipurpose Camouflage Pattern Design
    Abdi, Arash
    Safabakhsh, Reza
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (08) : 4748 - 4762
  • [2] [Anonymous], 1908, BIOMETRIKA, V6, P1
  • [3] Ba J.L., 2016, arXiv, DOI DOI 10.48550/ARXIV.1607.06450
  • [4] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [5] Camouflaged Object Detection via Context-Aware Cross-Level Fusion
    Chen, Geng
    Liu, Si-Jie
    Sun, Yu-Jia
    Ji, Ge-Peng
    Wu, Ya-Feng
    Zhou, Tao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6981 - 6993
  • [6] Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
  • [7] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
  • [8] El-Nouby A, 2021, Arxiv, DOI arXiv:2106.09681
  • [9] Fan DP, 2018, Arxiv, DOI arXiv:1805.10421
  • [10] Concealed Object Detection
    Fan, Deng-Ping
    Ji, Ge-Peng
    Cheng, Ming-Ming
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6024 - 6042