Siamese Tracking Network with Multi-attention Mechanism

被引:0
|
作者
Xu, Yuzhuo [1 ]
Li, Ting [1 ]
Zhu, Bing [2 ]
Wang, Fasheng [1 ]
Sun, Fuming [1 ]
机构
[1] Dalian Minzu Univ, Sch Informat & Commun Engn, Liaohexi Rd, Dalian 116600, Liaoning, Peoples R China
[2] Harbin Inst Technol, Dept Informat Engn, Xidazhi St, Harbin 150006, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Object tracking; Feature representation; Multi-scale feature fusion; Transformer; Multi-attention mechanism; VISUAL TRACKING;
D O I
10.1007/s11063-024-11670-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object trackers based on Siamese networks view tracking as a similarity-matching process. However, the correlation operation operates as a local linear matching process, limiting the tracker's ability to capture the intricate nonlinear relationship between the template and search region branches. Moreover, most trackers don't update the template and often use the first frame of an image as the initial template, which will easily lead to poor tracking performance of the algorithm when facing instances of deformation, scale variation, and occlusion of the tracking target. To this end, we propose a Simases tracking network with a multi-attention mechanism, including a template branch and a search branch. To adapt to changes in target appearance, we integrate dynamic templates and multi-attention mechanisms in the template branch to obtain more effective feature representation by fusing the features of initial templates and dynamic templates. To enhance the robustness of the tracking model, we utilize a multi-attention mechanism in the search branch that shares weights with the template branch to obtain multi-scale feature representation by fusing search region features at different scales. In addition, we design a lightweight and simple feature fusion mechanism, in which the Transformer encoder structure is utilized to fuse the information of the template area and search area, and the dynamic template is updated online based on confidence. Experimental results on publicly tracking datasets show that the proposed method achieves competitive results compared to several state-of-the-art trackers.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Multi-attention embedded network for salient object detection
    He, Wei
    Pan, Chen
    Xu, Wenlong
    Zhang, Ning
    SOFT COMPUTING, 2021, 25 (20) : 13053 - 13067
  • [42] MAIN: Multi-Attention Instance Network for video segmentation
    Alcazar, Juan Leon
    Bravo, Maria A.
    Jeanneret, Guillaume
    Thabet, Ali K.
    Brox, Thomas
    Arbelaez, Pablo
    Ghanem, Bernard
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 210
  • [43] Multi-Attention Relation Network for Figure Question Answering
    Li, Ying
    Wu, Qingfeng
    Chen, Bin
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, 2022, 13369 : 667 - 680
  • [44] MACNet: Multi-Attention and Context Network for Polyp Segmentation
    Hao, Xiuzhen
    Pan, Haiwei
    Zhang, Kejia
    Chen, Chunling
    Bian, Xiaofei
    He, Shuning
    WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 369 - 384
  • [45] Multi-Attention Residual Network for Image Super Resolution
    Chang, Qing
    Jia, Xiaotian
    Lu, Chenhao
    Ye, Jian
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (08)
  • [46] A graph multi-attention network for predicting airport delays
    Zheng, Hongfeng
    Wang, Ziming
    Zheng, Chuanpan
    Wang, Yanjun
    Fan, Xiaoliang
    Cong, Wei
    Hu, Minghua
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 181
  • [47] GMAN: A Graph Multi-Attention Network for Traffic Prediction
    Zheng, Chuanpan
    Fan, Xiaoliang
    Wang, Cheng
    Qi, Jianzhong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1234 - 1241
  • [48] Siamese Network Target Tracking Algorithm Based on Collaborative Attention Network
    Xue, Zihan
    Ge, Haibo
    Yang, Yudi
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 426 - 431
  • [49] MULTI-ATTENTION NETWORK FOR THORACIC DISEASE CLASSIFICATION AND LOCALIZATION
    Ma, Yanbo
    Zhou, Qiuhao
    Chen, Xuesong
    Lu, Haihua
    Zhao, Yong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1378 - 1382
  • [50] Multi-Attention Convolutional Neural Network for Video Deblurring
    Zhang, Xiaoqin
    Wang, Tao
    Jiang, Runhua
    Zhao, Li
    Xu, Yuewang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 1986 - 1997