Visual and Language Collaborative Learning for RGBT Object Tracking

被引:5
作者
Wang, Jiahao [1 ,2 ]
Liu, Fang [1 ,2 ]
Jiao, Licheng [1 ,2 ]
Gao, Yingjia [1 ,2 ]
Wang, Hao [1 ,2 ]
Li, Shuo [1 ,2 ]
Li, Lingling [1 ,2 ]
Chen, Puhua [1 ,2 ]
Liu, Xu [1 ,2 ]
机构
[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Shaanxi, Peoples R China
[2] Xidian Univ, Sch Artificial Intelligence, Joint Int Res Lab Intelligent Percept & Computat, Xian 710071, Shaanxi, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Target tracking; Feature extraction; Visualization; Object tracking; Task analysis; Lighting; Circuits and systems; RGBT object tracking; complementary features; target label information; prompt learning; prior boxes and language; T TRACKING; NETWORK;
D O I
10.1109/TCSVT.2024.3436878
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite the extensive research on RGBT object tracking, there are still several challenges and issues in practical applications, such as modality differences, lighting variations and disappearance of the target, and changes in viewpoint. Existing methods mostly address these issues by fusing image features, while neglecting a significant amount of target label information. To address these challenges, this paper introduces text to drive the alignment of visible and infrared image features, transforming features from different modalities into the same feature space and fully using complementary features between different modalities. Furthermore, inspired by the success of prompt learning in various tasks, we utilize prior boxes and language as prompts to further guide the model in tracking the target. Extensive experiments demonstrate that the proposed VLCTrack tracker has excellent potential in RGBT object tracking. Compared to previous methods developed for this purpose, our approach achieves state-of-the-art performance on three benchmark datasets.
引用
收藏
页码:12770 / 12781
页数:12
相关论文
共 50 条
[31]   Visual Object Tracking: The Initialisation Problem [J].
De Ath, George ;
Everson, Richard .
2018 15TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2018, :142-149
[32]   Siamese Attentional Cascade Keypoints Network for Visual Object Tracking [J].
Wang, Ershen ;
Wang, Donglei ;
Huang, Yufeng ;
Tong, Gang ;
Xu, Song ;
Pang, Tao .
IEEE ACCESS, 2021, 9 :7243-7254
[33]   Enhanced RGBT Tracking Network With Semantic Generation and Historical Context [J].
Gao, Zhao ;
Zhou, Dongming ;
Cao, Jinde ;
Liu, Yisong ;
Shan, Qingqing .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[34]   Visual Object Tracking With Mutual Affinity Aligned to Human Intuition [J].
Zeng, Guotian ;
Zeng, Bi ;
Wei, Qingmao ;
Hu, Huiting ;
Zhang, Hong .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :10055-10068
[35]   Channel Graph Regularized Correlation Filters for Visual Object Tracking [J].
Jain, Monika ;
Tyagi, Arjun ;
Subramanyam, A., V ;
Denman, Simon ;
Sridharan, Sridha ;
Fookes, Clinton .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) :715-729
[36]   Robust Visual Tracking via Multitask Sparse Correlation Filters Learning [J].
Nai, Ke ;
Li, Zhiyong ;
Gan, Yihui ;
Wang, Qi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) :502-515
[37]   Learning Localization-Aware Target Confidence for Siamese Visual Tracking [J].
Nie, Jiahao ;
He, Zhiwei ;
Yang, Yuxiang ;
Gao, Mingyu ;
Dong, Zhekang .
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :6194-6206
[38]   Learning Deep Lucas-Kanade Siamese Network for Visual Tracking [J].
Yao, Siyuan ;
Han, Xiaoguang ;
Zhang, Hua ;
Wang, Xiao ;
Cao, Xiaochun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :4814-4827
[39]   Deep Siamese Cross-Residual Learning for Robust Visual Tracking [J].
Wu, Fan ;
Xu, Tingfa ;
Guo, Jie ;
Huang, Bo ;
Xu, Chang ;
Wang, Jihui ;
Li, Xiangmin .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (20) :15216-15227
[40]   Learning to Rank Proposals for Siamese Visual Tracking [J].
Tang, Feng ;
Ling, Qiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :8785-8796