RGBT tracking: A comprehensive review

被引:8
作者
Feng, Mingzheng
Su, Jianbo [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
RGBT tracking; Deep learning; Image fusion; FUSION TRACKING; MULTIMODAL FUSION; INFRARED IMAGES; T TRACKING; NETWORK; ROBUST; REGISTRATION; SIMILARITY; FRAMEWORK; ADAPTER;
D O I
10.1016/j.inffus.2024.102492
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, visual object tracking, as a prominent research area in computer vision, has garnered significant attention. To bolster the robustness of trackers across a spectrum of complex scenarios, researchers actively explore the synergistic potential of visible and thermal infrared images, aiming to design more potent tracking systems. This paper presents a comprehensive review of target tracking technology based on visible and thermal infrared information, encompassing three key aspects. Firstly, we categorize existing RGBT tracking methods into two main categories: traditional -based methods and deep learning -based methods. This classification facilitates a systematic understanding and comparison of the strengths and weaknesses of different approaches, providing a solid foundation for future research. Secondly, we focus on the evolution of RGBT datasets and analyze the performance of diverse tracking methods on these datasets. Research in this domain aids in evaluating the applicability of existing methods in real -world scenarios and offers guidance for future dataset construction. Finally, we delve into future research directions from multiple perspectives, including model design and dataset construction. In terms of model design, researchers are encouraged to explore more efficient feature extraction methods and innovative model fusion structures to further enhance tracker performance. Regarding dataset construction, increased attention should be given to ensure diversity in real -world scenarios, guaranteeing optimal tracker performance across a variety of complex conditions. In conclusion, this review makes a comprehensive analysis of the development of RGBT tracking from different perspectives, and provides a valuable reference for researchers in related fields such as multi -modal tracking and image fusion. By systematically classifying and analyzing existing research while outlining future research prospects, this review aims to foster the continued development of this field and inspire the emergence of more innovative work.
引用
收藏
页数:23
相关论文
共 168 条
[1]   Fully-Convolutional Siamese Networks for Object Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Henriques, Joao F. ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865
[2]   Learning Discriminative Model Prediction for Tracking [J].
Bhat, Goutam ;
Danelljan, Martin ;
Van Gool, Luc ;
Timofte, Radu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6181-6190
[3]   Thermal-visible registration of human silhouettes: A similarity measure performance evaluation [J].
Bilodeau, Guillaume-Alexandre ;
Torabi, Atousa ;
St-Charles, Pierre-Luc ;
Riahi, Dorra .
INFRARED PHYSICS & TECHNOLOGY, 2014, 64 :79-86
[4]  
Bolme DS, 2010, PROC CVPR IEEE, P2544, DOI 10.1109/CVPR.2010.5539960
[5]   Learning modality feature fusion via transformer for RGBT-tracking [J].
Cai, Yujue ;
Sui, Xiubao ;
Gu, Guohua ;
Chen, Qian .
INFRARED PHYSICS & TECHNOLOGY, 2023, 133
[6]   Multi-modal multi-task feature fusion for RGBT tracking [J].
Cai, Yujue ;
Sui, Xiubao ;
Gu, Guohua .
INFORMATION FUSION, 2023, 97
[7]  
Cao B, 2024, AAAI CONF ARTIF INTE, P927
[8]   Transformer Tracking [J].
Chen, Xin ;
Yan, Bin ;
Zhu, Jiawen ;
Wang, Dong ;
Yang, Xiaoyun ;
Lu, Huchuan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8122-8131
[9]   Siamese Box Adaptive Network for Visual Tracking [J].
Chen, Zedu ;
Zhong, Bineng ;
Li, Guorong ;
Zhang, Shengping ;
Ji, Rongrong .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6667-6676
[10]   Fusion Tree Network for RGBT Tracking [J].
Cheng, Zhiyuan ;
Lu, Andong ;
Zhang, Zhang ;
Li, Chenglong ;
Wang, Liang .
2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,