TPRNet: camouflaged object detection via transformer-induced progressive refinement network

被引:37
作者
Zhang, Qiao [1 ]
Ge, Yanliang [1 ]
Zhang, Cong [1 ]
Bi, Hongbo [1 ]
机构
[1] Northeast Petr Univ, Sch Elect Informat Engn, Daqing 163000, Peoples R China
关键词
Deep learning; Camouflaged object detection; Transformer; Progressive refinement;
D O I
10.1007/s00371-022-02611-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Camouflaged object detection (COD) is a challenging task which aims to detect objects similar to the surrounding environment. In this paper, we propose a transformer-induced progressive refinement network (TPRNet) to solve challenging COD tasks. Specifically, our network includes a Transformer-induced Progressive Refinement Module (TPRM) and a Semantic-Spatial Interaction Enhancement Module (SIEM). In TPRM, high-level features with rich semantic information are integrated through transformers as prior guidance, and then, it is sent to the refinement concurrency unit (RCU), and the accurately positioned feature area is obtained through a progressive refinement strategy. In SIEM, we perform feature interaction to localizedaccurate semantic features and low-level features to obtain rich fine-grained clues and increase the symbolic power of boundary features. Extensive experiments on four widely used benchmark datasets (i.e., CAMO, CHAMELEON, COD10K, and NC4K) demonstrate that our TPRNet is an effective COD model and outperforms state-of-the-art models. The code is available https://github.com/zhangyiao970914/TPRNet.
引用
收藏
页码:4593 / 4607
页数:15
相关论文
共 58 条
  • [1] Amit SNKB, 2016, INT GEOSCI REMOTE SE, P5189, DOI 10.1109/IGARSS.2016.7730352
  • [2] Ba J, 2016, ARXIV160706450
  • [3] Rethinking Camouflaged Object Detection: Models and Datasets
    Bi, Hongbo
    Zhang, Cong
    Wang, Kang
    Tong, Jinghui
    Zheng, Feng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 5708 - 5724
  • [4] Bi HB, 2021, VISUAL COMPUT, V37, P911, DOI 10.1007/s00371-020-01842-4
  • [5] You Only Look One-level Feature
    Chen, Qiang
    Wang, Yingming
    Yang, Tong
    Zhang, Xiangyu
    Cheng, Jian
    Sun, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13034 - 13043
  • [6] DG-Labeler and DGL-MOTS Dataset: Boost the Autonomous Driving Perception
    Cui, Yiming
    Cao, Zhiwen
    Xie, Yixin
    Jiang, Xingyu
    Tao, Feng
    Chen, Yingjie Victor
    Li, Lin
    Liu, Dongfang
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3411 - 3420
  • [7] Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
  • [8] Dong B., 2021, ARXIV PREPRINT ARXIV, V1
  • [9] Dosovitskiy A., 2020, ICLR, V20, DOI 10.48550/arXiv.2010.11929
  • [10] The kinetics of carbon monoxide reduction of magnetite concentrate particles through CFD modelling
    Fan, De-Qiu
    Elzohiery, Mohamed
    Mohassab, Yousef
    Sohn, H. Y.
    [J]. IRONMAKING & STEELMAKING, 2021, 48 (07) : 769 - 778