Evota: an enhanced visual object tracking network with attention mechanism

被引:0
|
作者
An Zhao
Yi Zhang
机构
[1] Sichuan University,Department of Computer Science
来源
关键词
Attention mechanism; Visual tracking; Transformer;
D O I
暂无
中图分类号
学科分类号
摘要
Transformer architecture has made breakthrough in various downstream computer vision tasks and has shown its great potential in visual object tracking. However, existing transformer-based approaches adopt pixel-to-pixel attention strategy to integrate the domain knowledge, but fail to explore the channel and location information from object features, which limits the expressivity of the tracker. To address the above problems, we propose a novel tracking framework, where we propose 2 attention blocks that fuses with Transformer (dubbed EVOTA). It has 4 modules: the feature extraction module, the enhanced attention module, a transformer module and a model predictor. Specifically, a channel-wise attention module re-calibrates the channel-wise feature responses in an adaptive way by modelling interdependencies explicitly between channels. A local cross-channel interaction scheme learns strong channel context information. Meanwhile, an energy function is developed to analyze the importance of each neuron and infers their 3D weights. Extensive experiments have been carried out on 5 prevalent tracking benchmarks to testify the effectiveness of our model, in which EVOTA outperforms several state-of-the-art methods.
引用
收藏
页码:24939 / 24960
页数:21
相关论文
共 50 条
  • [41] Siamese Feedback Network for Visual Object Tracking
    Gwon M.-G.
    Kim J.
    Um G.-M.
    Lee H.
    Seo J.
    Lim S.Y.
    Yang S.-J.
    Kim W.
    IEIE Transactions on Smart Processing and Computing, 2022, 11 (01): : 24 - 33
  • [42] Online Siamese Network for Visual Object Tracking
    Chang, Shuo
    Li, Wei
    Zhang, Yifan
    Feng, Zhiyong
    SENSORS, 2019, 19 (08)
  • [43] SiamAtt: Siamese attention network for visual tracking
    Yang, Kai
    He, Zhenyu
    Zhou, Zikun
    Fan, Nana
    KNOWLEDGE-BASED SYSTEMS, 2020, 203
  • [44] EANTrack: An Efficient Attention Network for Visual Tracking
    Gu, Fengwei
    Lu, Jun
    Cai, Chengtao
    Zhu, Qidan
    Ju, Zhaojie
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 5911 - 5928
  • [45] Spatial and Channel Attention Mechanism Method for Object Tracking
    Liu Jiamin
    Xie Wenjie
    Huang Hong
    Tang Yiming
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (09) : 2569 - 2576
  • [46] Object tracking based on Siamese networks and attention mechanism
    Yan, Zhengbang
    Quan, Wenjun
    Yang, Congxian
    Wang, Wei
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
  • [47] Adaptive object tracking based on spatial attention mechanism
    Xie Y.
    Chen Y.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2019, 41 (09): : 1945 - 1954
  • [48] A Visual Tracking Algorithm Combining Parallel Network and Dual Attention-Aware Mechanism
    Ge, Haibo
    Wang, Shuxian
    Huang, Chaofeng
    An, Yu
    IEEE ACCESS, 2023, 11 : 15831 - 15844
  • [49] Learning attention for object tracking with adversarial learning network
    Cheng, Xu
    Song, Chen
    Gu, Yongxiang
    Chen, Beijing
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [50] Learning attention for object tracking with adversarial learning network
    Xu Cheng
    Chen Song
    Yongxiang Gu
    Beijing Chen
    EURASIP Journal on Image and Video Processing, 2020