Visual and Language Collaborative Learning for RGBT Object Tracking

被引：0

作者：

Wang, Jiahao ^{[1
,2
]}

Liu, Fang ^{[1
,2
]}

Jiao, Licheng ^{[1
,2
]}

Gao, Yingjia ^{[1
,2
]}

Wang, Hao ^{[1
,2
]}

Li, Shuo ^{[1
,2
]}

Li, Lingling ^{[1
,2
]}

Chen, Puhua ^{[1
,2
]}

Liu, Xu ^{[1
,2
]}

机构：

[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Shaanxi, Peoples R China

[2] Xidian Univ, Sch Artificial Intelligence, Joint Int Res Lab Intelligent Percept & Computat, Xian 710071, Shaanxi, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 12期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Target tracking; Feature extraction; Visualization; Object tracking; Task analysis; Lighting; Circuits and systems; RGBT object tracking; complementary features; target label information; prompt learning; prior boxes and language; T TRACKING; NETWORK;

D O I：

10.1109/TCSVT.2024.3436878

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Despite the extensive research on RGBT object tracking, there are still several challenges and issues in practical applications, such as modality differences, lighting variations and disappearance of the target, and changes in viewpoint. Existing methods mostly address these issues by fusing image features, while neglecting a significant amount of target label information. To address these challenges, this paper introduces text to drive the alignment of visible and infrared image features, transforming features from different modalities into the same feature space and fully using complementary features between different modalities. Furthermore, inspired by the success of prompt learning in various tasks, we utilize prior boxes and language as prompts to further guide the model in tracking the target. Extensive experiments demonstrate that the proposed VLCTrack tracker has excellent potential in RGBT object tracking. Compared to previous methods developed for this purpose, our approach achieves state-of-the-art performance on three benchmark datasets.

引用

页码：12770 / 12781

页数：12

共 50 条

[1] Exploring fusion strategies for accurate RGBT visual object tracking
Tang, Zhangyong
Xu, Tianyang
Li, Hui
Wu, Xiao-Jun
Zhu, XueFeng
Kittler, Josef
INFORMATION FUSION, 2023, 99
[2] Trans-RGBT：RGBT Object Tracking with Transformer
Wanjun, Liu
Linlin, Liang
Haicheng, Qu
Computer Engineering and Applications, 2024, 60 (11) : 84 - 94
[3] Collaborative strategy for visual object tracking
Yang, Yongquan
Chen, Ning
Jiang, Shenlu
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (06) : 7283 - 7303
[4] Collaborative strategy for visual object tracking
Yongquan Yang
Ning Chen
Shenlu Jiang
Multimedia Tools and Applications, 2018, 77 : 7283 - 7303
[5] Learning Collaborative Model for Visual Tracking
Ma, Ding
Bu, Wei
Cui, Yuehua
Xie, Yuying
Wu, Xiangqian
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2582 - 2587
[6] Specific and Collaborative Representations Siamese Network for RGBT Tracking
Liu, Yisong
Zhou, Dongming
Cao, Jinde
Yan, Kaixiang
Geng, Lizhi
IEEE SENSORS JOURNAL, 2024, 24 (11) : 18520 - 18534
[7] Visual object tracking via collaborative correlation filters
Lu, Xiaohuan
Li, Jing
He, Zhenyu
Liu, Wei
You, Lei
SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (01) : 177 - 185
[8] Collaborative Visual Object Tracking via Hierarchical Structure
Tu, Fangwen
Ge, Shuzhi Sam
Suryadi, Henry Pratama
Tang, Yazhe
Hang, Chang Chieh
SOCIAL ROBOTICS, (ICSR 2016), 2016, 9979 : 413 - 421
[9] Visual object tracking via collaborative correlation filters
Xiaohuan Lu
Jing Li
Zhenyu He
Wei Liu
Lei You
Signal, Image and Video Processing, 2020, 14 : 177 - 185
[10] Object tracking with collaborative extreme learning machines
Kuang, Haipeng
Xun, Liang
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (7-8) : 4965 - 4988

← 1 2 3 4 5 →