Siamese visual tracking based on criss-cross attention and improved head network

被引:24
|
作者
Zhang, Jianming [1 ,2 ]
Huang, Haitao [1 ,2 ]
Jin, Xiaokang [3 ]
Kuang, Li-Dan [2 ]
Zhang, Jin [2 ]
机构
[1] Changsha Univ Sci & Technol, Key Lab Safety Control Bridge Engn, Minist Educ, Changsha 410114, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
[3] Jinhua Adv Res Inst, Jinhua 321013, Peoples R China
基金
中国国家自然科学基金;
关键词
Visual tracking; Siamese network; Deep learning; Attention mechanism; Anchor-free; Center-ness; OBJECT; ROBUST;
D O I
10.1007/s11042-023-15429-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The efficient Siamese anchor-free tracker has fewer parameters, but it produces a large number of low-quality bounding boxes which are located far away from the center of the object. Moreover, a plenty of background information or distractors also interfere with the tracking process, resulting in the inaccurate results of classification and regression. As such, we propose a novel Siamese anchor-free network based on criss-cross attention and an improved head network. We apply ResNet-50 to extract the features of the template image and search region, then feed the feature maps into a recurrent criss-cross attention module to make it more discriminative. The enhanced feature maps are inputted into our improved head network, which include the center-ness branch based on the original classification and regression branches to filter out low-quality bounding boxes. Our proposed tracker reduces the impact of background information or distractors and can obtain high-quality bounding boxes, generating more accurate and robust tracking results. Extensive experiments and comparisons with state-of-the-art trackers are conducted on many challenging benchmarks such as VOT2016, VOT2018, GOT-10k, UAV123 and OTB2015. Our tracker achieves excellent performance with a considerable real-time speed.
引用
收藏
页码:1589 / 1615
页数:27
相关论文
共 50 条
  • [1] Siamese visual tracking based on criss-cross attention and improved head network
    Jianming Zhang
    Haitao Huang
    Xiaokang Jin
    Li-Dan Kuang
    Jin Zhang
    Multimedia Tools and Applications, 2024, 83 : 1589 - 1615
  • [2] Criss-Cross Attentional Siamese Networks for Object Tracking
    Wang, Zhangdong
    Qin, Jiaohua
    Xiang, Xuyu
    Tan, Yun
    Xiong, Neal N.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 2931 - 2946
  • [3] ATCC: Accurate tracking by criss-cross location attention
    Wu, Yong
    Liu, Zhi
    Zhou, Xiaofei
    Ye, Linwei
    Wang, Yang
    IMAGE AND VISION COMPUTING, 2021, 111
  • [4] Siamese Network Based on MLP and Multi-head Cross Attention for Visual Object Tracking
    Li, Piaoyang
    Lan, Shiyong
    Sun, Shipeng
    Wang, Wenwu
    Gao, Yongyang
    Yang, Yongyu
    Yu, Guangyu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 420 - 431
  • [5] Visual software defect prediction method based on improved recurrent criss-cross residual network
    Chen, Liqiong
    Yunjie, Lei
    Huaiying, Sun
    International Journal of Web Information Systems, 20 (06): : 621 - 638
  • [6] Visual software defect prediction method based on improved recurrent criss-cross residual network
    Chen, Liqiong
    Yunjie, Lei
    Huaiying, Sun
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2024, 20 (06) : 621 - 638
  • [7] An Improved Siamese Tracking Network Based On Self-Attention And Cross-Attention
    Lai Yijun
    Song Jianmei
    She Haoping
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 466 - 470
  • [8] Visual Tracking With Siamese Network Based on Fast Attention Network
    Qin, Lin
    Yang, Yang
    Huang, Dandan
    Zhu, Naibo
    Yang, Han
    Xu, Zhisong
    IEEE ACCESS, 2022, 10 : 35632 - 35642
  • [9] CCNet: Criss-Cross Attention for Semantic Segmentation
    Huang, Zilong
    Wang, Xinggang
    Huang, Lichao
    Huang, Chang
    Wei, Yunchao
    Liu, Wenyu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 603 - 612
  • [10] CCNet: Criss-Cross Attention for Semantic Segmentation
    Huang, Zilong
    Wang, Xinggang
    Wei, Yunchao
    Huang, Lichao
    Shi, Humphrey
    Liu, Wenyu
    Huang, Thomas S.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6896 - 6908