A three-stage model for camouflaged object detection

被引:5
作者
Chen, Tianyou [1 ]
Ruan, Hui [2 ]
Wang, Shaojie [2 ]
Xiao, Jin [3 ]
Hu, Xiaoguang [3 ]
机构
[1] Chinese Aeronaut Radio Elect Res Inst, Shanghai 200241, Peoples R China
[2] Minist Ind & Informat Technol, Elect Res Inst 5, Guangzhou 511370, Peoples R China
[3] Beihang Univ, Beijing 100191, Peoples R China
关键词
Camouflaged object detection; Coarse-to-fine refinement; Convolutional neural network; Multi-stage detection; NETWORK; FRAMEWORK; NET;
D O I
10.1016/j.neucom.2024.128784
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camouflaged objects are typically assimilated into their backgrounds and exhibit fuzzy boundaries. The complex environmental conditions and the high intrinsic similarity between camouflaged targets and their surroundings pose significant challenges inaccurately locating and segmenting these objects in their entirety. While existing methods have demonstrated remarkable performance in various real-world scenarios, they still face limitations when confronted with difficult cases, such as small targets, thin structures, and indistinct boundaries. Drawing inspiration from human visual perception when observing images containing camouflaged objects, we propose a three-stage model that enables coarse-to-fine segmentation in a single iteration. Specifically, our model employs three decoders to sequentially process subsampled features, cropped features, and high-resolution original features. This proposed approach not only reduces computational overhead but also mitigates interference caused by background noise. Furthermore, considering the significance of multi- scale information, we have designed a multi-scale feature enhancement module that enlarges the receptive field while preserving detailed structural cues. Additionally, a boundary enhancement module has been developed to enhance performance by leveraging boundary information. Subsequently, a mask-guided fusion module is proposed to generate fine-grained results by integrating coarse prediction maps with high-resolution feature maps. Our network shows superior performance without introducing unnecessary complexities. Upon acceptance of the paper, the source code will be made publicly available at https://github.com/clelouch/TSNet.
引用
收藏
页数:17
相关论文
共 86 条
[1]   Quality-Aware Selective Fusion Network for V-D-T Salient Object Detection [J].
Bao, Liuxin ;
Zhou, Xiaofei ;
Lu, Xiankai ;
Sun, Yaoqi ;
Yin, Haibing ;
Hu, Zhenghui ;
Zhang, Jiyong ;
Yan, Chenggang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :3212-3226
[2]   Adaptive fusion network for RGB-D salient object detection [J].
Chen, Tianyou ;
Xiao, Jin ;
Hu, Xiaoguang ;
Zhang, Guofeng ;
Wang, Shaojie .
NEUROCOMPUTING, 2023, 522 :152-164
[3]   Boundary-guided network for camouflaged object detection [J].
Chen, Tianyou ;
Xiao, Jin ;
Hu, Xiaoguang ;
Zhang, Guofeng ;
Wang, Shaojie .
KNOWLEDGE-BASED SYSTEMS, 2022, 248
[4]   BINet: Bidirectional interactive network for salient object detection [J].
Chen, Tianyou ;
Hu, Xiaoguang ;
Xiao, Jin ;
Zhang, Guofeng ;
Wang, Shaojie .
NEUROCOMPUTING, 2021, 465 :490-502
[5]   A rendezvous point-based data gathering in underwater wireless sensor networks for monitoring applications [J].
Choudhary, Monika ;
Goyal, Nitin .
INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2022, 35 (06)
[6]  
Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[7]  
Dosovitskiy A., 2021, INT C LEARNING REPRE
[8]  
Fan D., 2021, Sci. Sin. Inf.
[9]   Concealed Object Detection [J].
Fan, Deng-Ping ;
Ji, Ge-Peng ;
Cheng, Ming-Ming ;
Shao, Ling .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) :6024-6042
[10]   Structure-measure: A New Way to Evaluate Foreground Maps [J].
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Liu, Yun ;
Li, Tao ;
Borji, Ali .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567