TPRNet: camouflaged object detection via transformer-induced progressive refinement network

被引：54

作者：

Zhang, Qiao ^{[1
]}

Ge, Yanliang ^{[1
]}

Zhang, Cong ^{[1
]}

Bi, Hongbo ^{[1
]}

机构：

[1] Northeast Petr Univ, Sch Elect Informat Engn, Daqing 163000, Peoples R China

来源：

VISUAL COMPUTER | 2023年 / 39卷 / 10期

关键词：

Deep learning; Camouflaged object detection; Transformer; Progressive refinement;

D O I：

10.1007/s00371-022-02611-1

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Camouflaged object detection (COD) is a challenging task which aims to detect objects similar to the surrounding environment. In this paper, we propose a transformer-induced progressive refinement network (TPRNet) to solve challenging COD tasks. Specifically, our network includes a Transformer-induced Progressive Refinement Module (TPRM) and a Semantic-Spatial Interaction Enhancement Module (SIEM). In TPRM, high-level features with rich semantic information are integrated through transformers as prior guidance, and then, it is sent to the refinement concurrency unit (RCU), and the accurately positioned feature area is obtained through a progressive refinement strategy. In SIEM, we perform feature interaction to localizedaccurate semantic features and low-level features to obtain rich fine-grained clues and increase the symbolic power of boundary features. Extensive experiments on four widely used benchmark datasets (i.e., CAMO, CHAMELEON, COD10K, and NC4K) demonstrate that our TPRNet is an effective COD model and outperforms state-of-the-art models. The code is available https://github.com/zhangyiao970914/TPRNet.

引用

页码：4593 / 4607

页数：15

共 58 条

[1]

Amit SNKB, 2016, INT GEOSCI REMOTE SE, P5189, DOI 10.1109/IGARSS.2016.7730352

[2]

Ba J. L., 2016, CoRR

[3] Rethinking Camouflaged Object Detection: Models and Datasets [J].

Bi, Hongbo ;

Zhang, Cong ;

Wang, Kang ;

Tong, Jinghui ;

Zheng, Feng .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) :5708-5724

[4]

Bi HB, 2021, VISUAL COMPUT, V37, P911, DOI 10.1007/s00371-020-01842-4

[5] You Only Look One-level Feature [J].

Chen, Qiang ;

Wang, Yingming ;

Yang, Tong ;

Zhang, Xiangyu ;

Cheng, Jian ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13034-13043

[6] DG-Labeler and DGL-MOTS Dataset: Boost the Autonomous Driving Perception [J].

Cui, Yiming ;

Cao, Zhiwen ;

Xie, Yixin ;

Jiang, Xingyu ;

Tao, Feng ;

Chen, Yingjie Victor ;

Li, Lin ;

Liu, Dongfang .

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :3411-3420

[7]

Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26

[8]

Dong B., 2021, ARXIV PREPRINT ARXIV, V1

[9]

Dosovitskiy A., 2021, arXiv

[10] The kinetics of carbon monoxide reduction of magnetite concentrate particles through CFD modelling [J].

Fan, De-Qiu ;

Elzohiery, Mohamed ;

Mohassab, Yousef ;

Sohn, H. Y. .

IRONMAKING & STEELMAKING, 2021, 48 (07) :769-778

← 1 2 3 4 5 6 →