Depth alignment interaction network for camouflaged object detection

被引:0
作者
Hongbo Bi
Yuyu Tong
Jiayuan Zhang
Cong Zhang
Jinghui Tong
Wei Jin
机构
[1] Northeast Petroleum University,Sanya Offshore Oil and Gas Research Institute
[2] National University of Defense Technology,College of Electronic Countermeasures
[3] Northeast Petroleum University,School of Electrical Information Engineering
[4] Guizhou University,State Key Laboratory of Public Big Data
来源
Multimedia Systems | 2024年 / 30卷
关键词
Camouflaged object detection; Depth alignment index; Expanded pyramid interaction; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Many animals actively change their own characteristics, such as color and texture, through camouflage, a natural defense mechanism, making them difficult to be detected in the natural environment, which makes the task of camouflaged object detection extremely challenging. Biological research shows that the eyes of animals have three-dimensional perception ability, and the obtained depth information can provide useful object positioning clues for finding camouflaged objects. However, almost all the current studies for camouflaged object detection do not combine depth maps with RGB images. Therefore, combining depth maps with traditional unimodal RGB images is of great research significance to improve the accuracy of camouflaged object detection. In this paper, we propose a depth alignment interaction network for camouflaged object detection in which the depth maps used are generated from existing monocular depth estimation networks. To address the problem that the quality of the generated depth maps varies, we propose a depth alignment index method to evaluate the quality of the depth maps. The method dynamically assigns the proportion of depth maps in the fusion process to depth maps of different quality according to their alignment with RGB images. Then, to fully extract the fused artifact features, we design an expanded pyramid interaction module, which first expands the receptive field of the features in each layer. Then, the features at the higher levels interacted with the features at the lower levels by connecting them step-by-step to further refine the predicted camouflaged area. Extensive experiments on 4 camouflaged object detection datasets demonstrate the effectiveness of our solution for camouflaged object detection.
引用
收藏
相关论文
共 104 条
[1]  
Han J(2017)Cnns-based rgb-d saliency detection via cross-view transfer and multiview fusion IEEE Trans. Cybern. 48 3171-3183
[2]  
Chen H(2019)Disruptive coloration and binocular disparity: breaking camouflage Proc. R. Soc. B 286 20182045-563
[3]  
Liu N(2015)Three-dimensional camouflage: exploiting photons to conceal form Am. Nat. 186 553-1637
[4]  
Yan C(2020)Towards robust monocular depth estimation: mixing datasets for zero-shot cross-dataset transfer IEEE Trans. Pattern Anal. Mach. Intell. 44 1623-3930
[5]  
Li X(2021)Concealed object detection IEEE Trans. Pattern Anal. Mach. Intell. 27 3918-1895
[6]  
Adams WJ(2018)A fusion framework for camouflaged moving foreground detection in the wavelet domain IEEE Trans. Image Process. 9 1883-4082
[7]  
Graf EW(2018)Quantifying camouflage and conspicuousness using visual salience Methods Ecol. Evol. 75 4065-33
[8]  
Anderson M(2016)Camouflage performance analysis and evaluation framework based on features fusion Multimed. Tools Appl. 26 29-108
[9]  
Penacchio O(2018)Detection of people with camouflage pattern via dense deconvolution network IEEE Signal Process. Lett. 20 92-1119
[10]  
Lovell PG(2023)Deep gradient learning for efficient camouflaged object detection Mach. Intell. Res. 31 1107-10269