Detecting Camouflaged Objects via Multi-Stage Coarse-to-Fine Refinement

被引:2
作者
Wang, Yuye [1 ]
Chen, Tianyou [2 ]
Hu, Xiaoguang [3 ]
Shi, Jiaqi [3 ]
Jia, Zichong [3 ]
机构
[1] Minnan Normal Univ, Coll Phys & Informat Engn, Zhangzhou 363000, Fujian, Peoples R China
[2] Cent China Normal Univ, Fac Artificial Intelligence Educ, Wuhan 430079, Hubei, Peoples R China
[3] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Decoding; Feature extraction; Object detection; Object recognition; Semantics; Benchmark testing; Background noise; Convolutional neural networks; Image analysis; Camouflaged object detection; coarse-to-fine refinement; convolutional neural network; multi-stage detection; NETWORK; SEGMENTATION; FRAMEWORK; GUIDANCE; NET;
D O I
10.1109/ACCESS.2024.3380893
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Camouflaged objects are typically assimilated into their surroundings. Consequently, in contrast to generic object detection/segmentation, camouflaged object detection proves to be considerably more intricate due to the indistinct boundaries and heightened intrinsic similarities between foreground targets and the surrounding environment. Despite the proposition of numerous algorithms that have demonstrated commendable performance across various scenarios, these approaches may still grapple with blurred boundaries, leading to the inadvertent omission of camouflaged targets in challenging scenes. In this paper, we introduce a multi-stage framework tailored for segmenting camouflaged objects through a process of coarse-to-fine refinement. Specifically, our network encompasses three distinct decoders, each fulfilling a unique role in the model. In the initial decoder, we introduce the Bi-directional Locating Module to excavate foreground and background cues, enhancing target localization. The second decoder focuses on leveraging boundary information to augment overall performance, incorporating the Multi-level Feature Fusion Module to generate prediction maps with finer boundaries. Subsequently, the third decoder introduces the Mask-guided Fusion Module, designed to process high-resolution features under the guidance of the second decoder's results. This approach enables the preservation of structural details and the generation of fine-grained prediction maps. Through the integration of the three decoders, our model effectively identifies and segments camouflaged targets. Extensive experiments are conducted on three commonly used benchmark datasets. The results of these experiments demonstrate that, even without the application of pre-processing or post-processing techniques, our model outperforms 14 state-of-the-art algorithms.
引用
收藏
页码:44055 / 44068
页数:14
相关论文
共 65 条
[1]  
Chen J., 2022, Knowl.-Based Syst., V248
[2]  
Chen Q, 2021, AAAI CONF ARTIF INTE, V35, P1063
[3]   Reverse Attention-Based Residual Network for Salient Object Detection [J].
Chen, Shuhan ;
Tan, Xiuli ;
Wang, Ben ;
Lu, Huchuan ;
Hu, Xuelong ;
Fu, Yun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3763-3776
[4]   Adaptive fusion network for RGB-D salient object detection [J].
Chen, Tianyou ;
Xiao, Jin ;
Hu, Xiaoguang ;
Zhang, Guofeng ;
Wang, Shaojie .
NEUROCOMPUTING, 2023, 522 :152-164
[5]   BINet: Bidirectional interactive network for salient object detection [J].
Chen, Tianyou ;
Hu, Xiaoguang ;
Xiao, Jin ;
Zhang, Guofeng ;
Wang, Shaojie .
NEUROCOMPUTING, 2021, 465 :490-502
[6]   Camouflage Images [J].
Chu, Hung-Kuo ;
Hsu, Wei-Hsin ;
Mitra, Niloy J. ;
Cohen-Or, Daniel ;
Wong, Tien-Tsin ;
Lee, Tong-Yee .
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)
[7]   Ternary symmetric fusion network for camouflaged object detection [J].
Deng, Yangyang ;
Ma, Jianxin ;
Li, Yajun ;
Zhang, Min ;
Wang, Li .
APPLIED INTELLIGENCE, 2023, 53 (21) :25216-25231
[8]  
Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[9]  
Dosovitskiy A., 2021, INT C LEARNING REPRE
[10]   Concealed Object Detection [J].
Fan, Deng-Ping ;
Ji, Ge-Peng ;
Cheng, Ming-Ming ;
Shao, Ling .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) :6024-6042