Improved SwinTrack single target tracking algorithm based on spatio-temporal feature fusion

被引:2
作者
Zhao, Min [1 ,2 ]
Yue, Qiang [1 ]
Sun, Dihua [1 ]
Zhong, Yuan [1 ]
机构
[1] Chongqing Univ, Sch Automat, Chongqing, Peoples R China
[2] Chongqing Univ, Sch Automat, Chongqing 400044, Peoples R China
关键词
computer vision; feature extraction; image processing; object tracking;
D O I
10.1049/ipr2.12803
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade-off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi-level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets.
引用
收藏
页码:2410 / 2421
页数:12
相关论文
共 29 条
[1]   Fully-Convolutional Siamese Networks for Object Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Henriques, Joao F. ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865
[2]   Transformer Tracking [J].
Chen, Xin ;
Yan, Bin ;
Zhu, Jiawen ;
Wang, Dong ;
Yang, Xiaoyun ;
Lu, Huchuan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8122-8131
[3]   Siamese Box Adaptive Network for Visual Tracking [J].
Chen, Zedu ;
Zhong, Bineng ;
Li, Guorong ;
Zhang, Shengping ;
Ji, Rongrong .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6667-6676
[4]   Robust Long-Term Object Tracking via Improved Discriminative Model Prediction [J].
Choi, Seokeon ;
Lee, Junhyun ;
Lee, Yunsung ;
Hauptmann, Alexander .
COMPUTER VISION - ECCV 2020 WORKSHOPS, PT V, 2020, 12539 :602-617
[5]  
Cui Y., 2021, TARGET TRANSFORMED R
[6]   ATOM: Accurate Tracking by Overlap Maximization [J].
Danelljan, Martin ;
Bhat, Goutam ;
Khan, Fahad Shahbaz ;
Felsberg, Michael .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4655-4664
[7]   ECO: Efficient Convolution Operators for Tracking [J].
Danelljan, Martin ;
Bhat, Goutam ;
Khan, Fahad Shahbaz ;
Felsberg, Michael .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6931-6939
[8]   SiamCAR: Siamese Fully Convolutional Classification and Regression for Visual Tracking [J].
Guo, Dongyan ;
Wang, Jun ;
Cui, Ying ;
Wang, Zhenhua ;
Chen, Shengyong .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6268-6276
[9]  
Guo Meng-Hao, 2022, Visual Attention Network
[10]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]