An Improved Siamese Tracking Network Based On Self-Attention And Cross-Attention

被引:0
作者
Lai Yijun [1 ]
Song Jianmei [1 ]
She Haoping [1 ]
机构
[1] Beijing Inst Technol, Sch Aerosp Engn, Beijing, Peoples R China
来源
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC | 2023年
关键词
object tracking; Siamese network; self-attention; cross-attention;
D O I
10.1109/CCDC58219.2023.10326870
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Siamese visual tracking network SiamRPN++ is found that its success rate and robustness is unsatisfactory when meeting complex scenes such as occlusion, large deformation, interference of similar objects and long-time tracking. Refer to these, we propose an improvement strategy based on self-attention and cross-attention mechanism. For backbone, we use Channel and Space self-attention modules, and we using different cross channel attention modules between template features and search features in every three RPN modules, finally using special self-attention on similarity feature maps. These tricks effectively suppress interference, improve the features' quality and make progress in robustness. Comparing with original SiamRPN++ with parameters from official open-source frame, PySOT, our network improves robustness of 3% on VOT2018, accuracy of 2% and success rate of 3% on OTB100.
引用
收藏
页码:466 / 470
页数:5
相关论文
共 25 条
  • [1] DensSiam: End-to-End Densely-Siamese Network with Self-Attention Model for Object Tracking
    Abdelpakey, Mohamed H.
    Shehata, Mohamed S.
    Mohamed, Mostafa M.
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2018, 2018, 11241 : 463 - 473
  • [2] [Anonymous], 2018, PROCEEDINGS OF THE I, DOI DOI 10.1109/CVPR.2018.00474
  • [3] Fully-Convolutional Siamese Networks for Object Tracking
    Bertinetto, Luca
    Valmadre, Jack
    Henriques, Joao F.
    Vedaldi, Andrea
    Torr, Philip H. S.
    [J]. COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 850 - 865
  • [4] LaSOT: A High-quality Benchmark for Large-scale Single Object Tracking
    Fan, Heng
    Lin, Liting
    Yang, Fan
    Chu, Peng
    Deng, Ge
    Yu, Sijia
    Bai, Hexin
    Xu, Yong
    Liao, Chunyuan
    Ling, Haibin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5369 - 5378
  • [5] A Twofold Siamese Network for Real-Time Object Tracking
    He, Anfeng
    Luo, Chong
    Tian, Xinmei
    Zeng, Wenjun
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4834 - 4843
  • [6] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [7] HUANG L, 2019, Transactions on Pattern Analysis and Machine Intelligence
  • [8] Kristan M., EUROPEAN C COMPUTER
  • [9] SiamRPN plus plus : Evolution of Siamese Visual Tracking with Very Deep Networks
    Li, Bo
    Wu, Wei
    Wang, Qiang
    Zhang, Fangyi
    Xing, Junliang
    Yan, Junjie
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4277 - 4286
  • [10] High Performance Visual Tracking with Siamese Region Proposal Network
    Li, Bo
    Yan, Junjie
    Wu, Wei
    Zhu, Zheng
    Hu, Xiaolin
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8971 - 8980