Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers

被引:90
|
作者
Huang, Zhou [1 ,2 ]
Dai, Hang [3 ]
Xiang, Tian-Zhu [4 ]
Wang, Shuo [5 ]
Chen, Huai-Xin [2 ]
Qin, Jie [6 ]
Xiong, Huan [7 ]
机构
[1] Sichuan Changhong Elect Co Ltd, Mianyang, Sichuan, Peoples R China
[2] UESTC, Chengdu, Peoples R China
[3] Univ Glasgow, Glasgow, Lanark, Scotland
[4] G42, Shanghai, Peoples R China
[5] Swiss Fed Inst Technol, Zurich, Switzerland
[6] NUAA, CCST, Nanjing, Peoples R China
[7] MBZUAI, Abu Dhabi, U Arab Emirates
关键词
D O I
10.1109/CVPR52729.2023.00538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision transformers have recently shown strong global context modeling capabilities in camouflaged object detection. However, they suffer from two major limitations: less effective locality modeling and insufficient feature aggregation in decoders, which are not conducive to camouflaged object detection that explores subtle cues from indistinguishable backgrounds. To address these issues, in this paper, we propose a novel transformer-based Feature Shrinkage Pyramid Network (FSPNet), which aims to hierarchically decode locality-enhanced neighboring transformer features through progressive shrinking for camouflaged object detection. Specifically, we propose a nonlocal token enhancement module (NL-TEM) that employs the non-local mechanism to interact neighboring tokens and explore graph-based high-order relations within tokens to enhance local representations of transformers. Moreover, we design a feature shrinkage decoder (FSD) with adjacent interaction modules (AIM), which progressively aggregates adjacent transformer features through a layer-by-layer shrinkage pyramid to accumulate imperceptible but effective cues as much as possible for object information decoding. Extensive quantitative and qualitative experiments demonstrate that the proposed model significantly outperforms the existing 24 competitors on three challenging COD benchmark datasets under six widely-used evaluation metrics. Our code is publicly available at https: //github.com/ZhouHuang23/FSPNet.
引用
收藏
页码:5557 / 5566
页数:10
相关论文
共 50 条
  • [1] Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers
    Huang, Zhou
    Dai, Hang
    Xiang, Tian-Zhu
    Wang, Shuo
    Chen, Huai-Xin
    Qin, Jie
    Xiong, Huan
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023, 2023-June : 5557 - 5566
  • [2] FocusTR: Focusing on Valuable Feature by Multiple Transformers for Fusing Feature Pyramid on Object Detection
    Xie, Bangquan
    Yang, Liang
    Yang, Zongming
    Wei, Ailin
    Weng, Xiaoxiong
    Li, Bing
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 518 - 525
  • [3] ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object Detection
    Pang, Youwei
    Zhao, Xiaoqi
    Xiang, Tian-Zhu
    Zhang, Lihe
    Lu, Huchuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 9205 - 9220
  • [4] Centralized Feature Pyramid for Object Detection
    Quan, Yu
    Zhang, Dong
    Zhang, Liyan
    Tang, Jinhui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4341 - 4354
  • [5] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [6] Content-augmented feature pyramid network with light linear spatial transformers for object detection
    Gu, Yongxiang
    Qin, Xiaolin
    Peng, Yuncong
    Li, Lu
    IET IMAGE PROCESSING, 2022, 16 (13) : 3567 - 3578
  • [7] Camouflaged Object Detection with Feature Grafting and Distractor Aware
    Song, Yuxuan
    Li, Xinyue
    Qi, Lin
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2459 - 2464
  • [8] Feature Aggregation and Propagation Network for Camouflaged Object Detection
    Zhou, Tao
    Zhou, Yi
    Gong, Chen
    Yang, Jian
    Zhang, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 7036 - 7047
  • [9] Camouflaged Object Detection with Feature Decomposition and Edge Reconstruction
    He, Chunming
    Li, Kai
    Zhang, Yachao
    Tang, Longxiang
    Zhang, Yulun
    Guo, Zhenhua
    Li, Xiu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22046 - 22055
  • [10] Camouflaged Object Detection with a Feature Lateral Connection Network
    Wang, Tao
    Wang, Jian
    Wang, Ruihao
    ELECTRONICS, 2023, 12 (12)