Boosting Camouflaged Object Detection with Dual-Task Interactive Transformer

被引:38
|
作者
Liu, Zhengyi [1 ]
Zhang, Zhili [1 ]
Tan, Yacheng [1 ]
Wu, Wei [1 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei, Anhui, Peoples R China
来源
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2022年
关键词
camouflaged object detection; boundary detection; transformer; interactive; multi-task learning;
D O I
10.1109/ICPR56361.2022.9956724
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camouflaged object detection intends to discover the concealed objects hidden in the surroundings. Existing methods follow the bio-inspired framework, which first locates the object and second refines the boundary. We argue that the discovery of camouflaged objects depends on the recurrent search for the object and the boundary. The recurrent processing makes the human tired and helpless, but it is just the advantage of the transformer with global search ability. Therefore, a dual-task interactive transformer is proposed to detect both accurate position of the camouflaged object and its detailed boundary. The boundary feature is considered as Query to improve the camouflaged object detection, and meanwhile the object feature is considered as Query to improve the boundary detection. The camouflaged object detection and the boundary detection are fully interacted by multi-head self-attention. Besides, to obtain the initial object feature and boundary feature, transformer-based backbones are adopted to extract the foreground and background. The foreground is just object, while foreground minus background is considered as boundary. Here, the boundary feature can be obtained from blurry boundary region of the foreground and background. Supervised by the object, the background and the boundary ground truth, the proposed model achieves state-of-the-art performance in public datasets. https://github.com/liuzywen/COD
引用
收藏
页码:140 / 146
页数:7
相关论文
共 50 条
  • [41] Integrating Part-Object Relationship and Contrast for Camouflaged Object Detection
    Liu, Yi
    Zhang, Dingwen
    Zhang, Qiang
    Han, Jungong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 5154 - 5166
  • [42] DKTNet: Dual-Key Transformer Network for small object detection
    Xu, Shoukun
    Gu, Jianan
    Hua, Yining
    Liu, Yi
    NEUROCOMPUTING, 2023, 525 : 29 - 41
  • [43] Multi-information guided camouflaged object detection
    Shi, Caijuan
    Zhao, Lin
    Wang, Rui
    Zhang, Kun
    Kong, Fanyue
    Duan, Changyu
    IMAGE AND VISION COMPUTING, 2025, 156
  • [44] FSNet: Focus Scanning Network for Camouflaged Object Detection
    Song, Ze
    Kang, Xudong
    Wei, Xiaohui
    Liu, Haibo
    Dian, Renwei
    Li, Shutao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2267 - 2278
  • [45] Boundary enhancement and refinement network for camouflaged object detection
    Xia, Chenxing
    Cao, Huizhen
    Gao, Xiuju
    Ge, Bin
    Li, Kuan-Ching
    Fang, Xianjin
    Zhang, Yan
    Liang, Xingzhu
    MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
  • [46] FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection
    Zhao, Jianwei
    Li, Xin
    Yang, Fan
    Zhai, Qiang
    Luo, Ao
    Jiao, Zicheng
    Cheng, Hong
    COMPUTER VISION - ECCV 2024, PT LIII, 2025, 15111 : 181 - 198
  • [47] Depth alignment interaction network for camouflaged object detection
    Hongbo Bi
    Yuyu Tong
    Jiayuan Zhang
    Cong Zhang
    Jinghui Tong
    Wei Jin
    Multimedia Systems, 2024, 30
  • [48] Feature Aggregation and Propagation Network for Camouflaged Object Detection
    Zhou, Tao
    Zhou, Yi
    Gong, Chen
    Yang, Jian
    Zhang, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 7036 - 7047
  • [49] Camouflaged Object Detection Based on Ternary Cascade Perception
    Jiang, Xinhao
    Cai, Wei
    Ding, Yao
    Wang, Xin
    Yang, Zhiyong
    Di, Xingyu
    Gao, Weijie
    REMOTE SENSING, 2023, 15 (05)
  • [50] Dual-Task Feedback Learning for Tongue Detection via Super-Resolution Integration
    Sun, Ying
    Wei, Meiyi
    Chen, Gang
    MULTIMEDIA MODELING, MMM 2025, PT I, 2025, 15520 : 319 - 332