UBPT: Unidirectional and Bidirectional Prompts for RGBD Tracking

被引:0
作者
Ou, Zhou [1 ]
Zhang, Dawei [1 ,2 ]
Ying, Ge [1 ]
Zheng, Zhonglong [1 ,3 ]
机构
[1] Zhejiang Normal Univ, Sch Comp Sci & Technol, Jinhua 321004, Peoples R China
[2] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Peoples R China
[3] Zhejiang Normal Univ, Zhejiang Key Lab Intelligent Educ Technologyand Ap, Jinhua 321004, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal fusion; object tracking; prompt tuning; RGBD tracking; FUSION NETWORK;
D O I
10.1109/JSEN.2024.3470852
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
RGBD tracking addresses the limitations of single RGB modality tracking in complex environments by incorporating depth images. In recent years, the widespread use of depth sensors has made RGBD data available, greatly promoting the development of RGBD tracking. However, existing RGBD datasets still suffer from small scale and scarcity, as well as the dominance of modalities in the tracking process often fluctuates with changes in the open environments. Most existing RGBD trackers struggle to adaptively extract information from both modalities, hindering the models' capacity to leverage the combined strengths of RGB and depth data. To address this issue, we propose a novel prompt-based RGBD tracking framework named unidirectional and bidirectional prompts for RGBD tracking (UBPT). UBPT employs a dual-stream transformer architecture and implements both unidirectional and bidirectional prompt learning to extract RGB and depth modality information, fully achieving cross-modality feature interaction. It can dynamically perform unidirectional intramodality feature enhancement and bidirectional intermodality feature fusion. This approach can dynamically and adaptively enhance dominant features while suppressing nondominant features. Experimental findings reveal that our proposed tracker achieves advanced results compared to recent state-of-the-art methods on various RGBD tracking benchmarks, demonstrating its superior tracking performance.
引用
收藏
页码:37503 / 37513
页数:11
相关论文
共 44 条
  • [1] Learning Discriminative Model Prediction for Tracking
    Bhat, Goutam
    Danelljan, Martin
    Van Gool, Luc
    Timofte, Radu
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6181 - 6190
  • [2] 3D Part-Based Sparse Tracker with Automatic Synchronization and Registration
    Bibi, Adel
    Zhang, Tianzhu
    Ghanem, Bernard
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1439 - 1448
  • [3] Camplani M., 2015, Quantum Safe Cryptography and Security: An introduction, benefits, enablers, and challenges, V3, P01
  • [4] Transformer Tracking
    Chen, Xin
    Yan, Bin
    Zhu, Jiawen
    Wang, Dong
    Yang, Xiaoyun
    Lu, Huchuan
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8122 - 8131
  • [5] MixFormer: End-to-End Tracking with Iterative Mixed Attention
    Cui, Yutao
    Jiang, Cheng
    Wang, Limin
    Wu, Gangshan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13598 - 13608
  • [6] Dai HX, 2024, Arxiv, DOI [arXiv:2307.01187, DOI 10.48550/ARXIV.2412.13431]
  • [7] ATOM: Accurate Tracking by Overlap Maximization
    Danelljan, Martin
    Bhat, Goutam
    Khan, Fahad Shahbaz
    Felsberg, Michael
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4655 - 4664
  • [8] Lock-in Time-of-Flight (ToF) Cameras: A Survey
    Foix, Sergi
    Alenya, Guillem
    Torras, Carme
    [J]. IEEE SENSORS JOURNAL, 2011, 11 (09) : 1917 - 1926
  • [9] SiamCAR: Siamese Fully Convolutional Classification and Regression for Visual Tracking
    Guo, Dongyan
    Wang, Jun
    Cui, Ying
    Wang, Zhenhua
    Chen, Shengyong
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6268 - 6276
  • [10] Han Zhang, 2018, 2018 Chinese Control And Decision Conference (CCDC), P4856, DOI 10.1109/CCDC.2018.8407972