Bridging Search Region Interaction with Template for RGB-T Tracking

被引:44
|
作者
Hui, Tianrui [1 ,2 ]
Xun, Zizheng [3 ,5 ]
Peng, Fengguang [3 ,5 ]
Huang, Junshi [4 ]
Wei, Xiaoming [4 ]
Wei, Xiaolin [4 ]
Dai, Jiao [1 ,2 ]
Han, Jizhong [1 ,2 ]
Liu, Si [3 ,5 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Beihang Univ, Inst Artificial Intelligence, Beijing, Peoples R China
[4] Meituan, Beijing, Peoples R China
[5] Beihang Univ, Hangzhou Innovat Inst, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB-T tracking aims to leverage the mutual enhancement and complement ability of RGB and TIR modalities for improving the tracking process in various scenarios, where cross-modal interaction is the key component. Some previous methods concatenate the RGB and TIR search region features directly to perform a coarse interaction process with redundant background noises introduced. Many other methods sample candidate boxes from search frames and conduct various fusion approaches on isolated pairs of RGB and TIR boxes, which limits the cross-modal interaction within local regions and brings about inadequate context modeling. To alleviate these limitations, we propose a novel Template-Bridged Search region Interaction (TBSI) module which exploits templates as the medium to bridge the cross-modal interaction between RGB and TIR search regions by gathering and distributing target-relevant object and environment contexts. Original templates are also updated with enriched multimodal contexts from the template medium. Our TBSI module is inserted into a ViT backbone for joint feature extraction, search-template matching, and cross-modal interaction. Extensive experiments on three popular RGB-T tracking benchmarks demonstrate our method achieves new state-of-the-art performances. Code is available at https://github.com/RyanHTR/TBSI.
引用
收藏
页码:13630 / 13639
页数:10
相关论文
共 50 条
  • [1] RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating
    Li, Bo
    Peng, Fengguang
    Hui, Tianrui
    Wei, Xiaoming
    Wei, Xiaolin
    Zhang, Lijun
    Shi, Hang
    Liu, Si
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (01) : 634 - 649
  • [2] Attention interaction based RGB-T tracking method
    Wang W.
    Fu F.
    Lei H.
    Tang Z.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (03): : 435 - 444
  • [3] Region Selective Fusion Network for Robust RGB-T Tracking
    Yu, Zhencheng
    Fan, Huijie
    Wang, Qiang
    Li, Ziwan
    Tang, Yandong
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1357 - 1361
  • [4] Learning cross-modal interaction for RGB-T tracking
    Xu, Chunyan
    Cui, Zhen
    Wang, Chaoqun
    Zhou, Chuanwei
    Yang, Jian
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (01)
  • [5] Learning cross-modal interaction for RGB-T tracking
    Chunyan XU
    Zhen CUI
    Chaoqun WANG
    Chuanwei ZHOU
    Jian YANG
    Science China(Information Sciences), 2023, 66 (01) : 320 - 321
  • [6] Learning cross-modal interaction for RGB-T tracking
    Chunyan Xu
    Zhen Cui
    Chaoqun Wang
    Chuanwei Zhou
    Jian Yang
    Science China Information Sciences, 2023, 66
  • [7] Channel Exchanging for RGB-T Tracking
    Zhao, Long
    Zhu, Meng
    Ren, Honge
    Xue, Lingjixuan
    SENSORS, 2021, 21 (17)
  • [8] RGB-T object tracking: Benchmark and baseline
    Li, Chenglong
    Liang, Xinyan
    Lu, Yijuan
    Zhao, Nan
    Tang, Jin
    PATTERN RECOGNITION, 2019, 96
  • [9] Dynamic Tracking Aggregation with Transformers for RGB-T Tracking
    Liu, Xiaohu
    Lei, Zhiyong
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (01): : 80 - 88
  • [10] RGB-T tracking with frequency hybrid awareness
    Lei, Lei
    Li, Xianxian
    IMAGE AND VISION COMPUTING, 2024, 152