Progressively Aggregating Multi-Scale Scene Context Features for Camouflaged Object Detection

被引:0
作者
Liu Y. [1 ]
Zhang K.-H. [2 ]
Fan J.-Q. [3 ]
Zhao Y.-Q. [4 ]
Liu Q.-S. [2 ]
机构
[1] School of Automation, Nanjing University of Information Science and Technology, Nanjing
[2] School of Computer and Software, Engineering Research Center of Digital Forensics, Ministry of Education, Nanjing University of Information Science and Technology, Nanjing
[3] College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing
[4] Inspur Suzhou Intelligent Technology Co., Ltd, Suzhou
来源
Jisuanji Xuebao/Chinese Journal of Computers | 2022年 / 45卷 / 12期
关键词
Attention mechanism; Camouflaged object detection; Deep learning; Scene context;
D O I
10.11897/SP.J.1016.2022.02637
中图分类号
学科分类号
摘要
Camouflaged object detection (COD) is a computer vision task that imitates human visual mechanisms to recognize and locate camouflaged objects in complex scenes. However, the current COD methods cannot accurately discriminate the camouflage objects only by the local appearance features of the objects when meeting distractors with similar appearances. To this end, this paper presents a COD network based on progressively aggregating multi-scale scene context features, so that the accurate camouflaged object discrimination is realized by aggregating multi-stage semantic enhanced scene context features. Specifically, the network mainly has two novel designs: U-shape Context-Aware Module(UCAM) and Cross-level Feature Aggregation Module(CFAM). The UCAM aims to sense rich local to global context information such as detailed boundaries, texture features, and color changes of camouflaged objects. The CFAM combines the coordinate direction attention and the multi-level residual progressive feature aggregation mechanism to gradually aggregate complementary features between adjacent levels, strengthen the global semantics of camouflage objects and supplement local details. Extensive evaluations on 4 extremely challenging benchmarks including CHAMELEON, CAMO-Test, COD10K-Test, and NC4K, the experimental results have demonstrated that our model has achieved leading performance compared with state-of-the-art methods. © 2022, Science Press. All right reserved.
引用
收藏
页码:2637 / 2651
页数:14
相关论文
共 41 条
[1]  
Fan D P, Ji G P, Sun G, Et al., Camouflaged object detection, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2777-2787, (2020)
[2]  
Fan D P, Ji G P, Zhou T, Et al., Pranet: Parallel reverse attention network for polyp segmentation, Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 263-273, (2020)
[3]  
Fan D P, Zhou T, Ji G P, Et al., Inf-net: Automatic covid-19 lung infection segmentation from ct images, IEEE Transactions on Medical Imaging, 39, 8, pp. 2626-2637, (2020)
[4]  
Qiu L, Wu X, Yu Z., A high-efficiency fully convolutional networks for pixel-wise surface defect detection, IEEE Access, 7, pp. 15884-15893, (2019)
[5]  
Li T, Song H, Zhang K, Et al., Recurrent reverse attention guided residual learning for saliency object detection, Neurocomputing, 389, pp. 170-178, (2020)
[6]  
Liu N, Han J, Yang M H., Picanet: Learning pixel-wise contextual attention for saliency detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3089-3098, (2018)
[7]  
Qin X, Zhang Z, Huang C, Et al., Basnet: Boundary-aware salient object detection, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7479-7489, (2019)
[8]  
Wu Z, Su L, Huang Q., Cascaded partial decoder for fast and accurate salient object detection, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3907-3916, (2019)
[9]  
Zhao J X, Liu J J, Fan D P, Et al., EGNet: Edge guidance network for salient object detection, Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8779-8788, (2019)
[10]  
Wang Zheng-Wen, Song Hui-Hui, Fan Jia-Qing, Liu Qing-Shan, Semantic guided feature aggregation network for salient object detection, Acta Automatica Sinica