Online Multi-Scale Classification and Global Feature Modulation for Robust Visual Tracking

被引:3
作者
Gao, Qi [1 ]
Yin, Mingfeng [2 ]
Wu, Xiang [3 ]
Liu, Di [4 ]
Bo, Yuming [3 ]
机构
[1] Jiangsu Univ Technol, Coll Mech Engn, Changzhou 213001, Peoples R China
[2] Jiangsu Univ Technol, Sch Automobile & Traff Engn, Changzhou 213001, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Automat, Nanjing 210094, Peoples R China
[4] Nanjing Inst Technol, Sch Automat, Nanjing 211167, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Target tracking; Accuracy; Fuses; Modulation; Transformers; Real-time systems; Visual object tracking; coordinate attention; online multi-scale classification; global feature modulation; OBJECT TRACKING;
D O I
10.1109/TCSVT.2023.3343949
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent advanced trackers, composed of discriminative classification and dedicated bounding box estimation, have achieved remarkable advancements in performance of visual object tracking. However, existing methods cannot satisfy the demands of tracking tasks in complex scenes, such as occlusion, scale variations, and etc. To this end, we propose a novel online multi-scale classification and global feature modulation for robust visual tracking, which is developed over accurate tracking by overlap maximization, named ATOM+. First, coordinate attention (CA) is applied to enhance the target features in the channel dimension and spatial dimension, which can effectively optimize the feature representation ability of the backbone network. Second, an online multi-scale classification (OMC) module is designed. During the online tracking phase, more reliable matching responses are comprehensively generated by aggregating information from different scales related to the target. This new operation enables stable perception of the target by the tracker, particularly when severe changes in the appearance and posture of the target are encountered. Third, a global feature modulation (GFM) mechanism is constructed, which requires only a small amount of computational resources, to fuse the spatial contextual information of the template image into the search region. This integration refines the bounding box to obtain an accurate estimate of the target state. Finally, comprehensive experiments on conventional tracking benchmarks of OTB100, LaSOT, and VOT2018 show that our tracker can sufficiently address different challenging scenarios, and achieves state-of-the-art performance. For the average running speed, our tracker can achieve 37 FPS in real time.
引用
收藏
页码:5321 / 5334
页数:14
相关论文
共 50 条
  • [41] Visual tracking via robust multi-task multi-feature joint sparse representation
    Wang, Yong
    Luo, Xinbin
    Ding, Lu
    Hu, Shiqiang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (23) : 31447 - 31467
  • [42] Transformer-Based Multi-Scale Feature Remote Sensing Image Classification Model
    Sun, Ting
    Li, Jun
    Zhou, Xiangrui
    Chen, Zan
    [J]. IEEE ACCESS, 2025, 13 : 34095 - 34104
  • [43] Global Context Attention for Robust Visual Tracking
    Choi, Janghoon
    [J]. SENSORS, 2023, 23 (05)
  • [44] Latent Subspace Projection Pursuit with Online Optimization for Robust Visual Tracking
    Liu, Risheng
    Jin, Wei
    Su, Zhixun
    Zhang, Changcheng
    [J]. IEEE MULTIMEDIA, 2014, 21 (04) : 47 - 55
  • [45] Spatial feature embedding for robust visual object tracking
    Liu, Kang
    Liu, Long
    Yang, Shangqi
    Fu, Zhihao
    [J]. IET COMPUTER VISION, 2024, 18 (04) : 540 - 556
  • [46] Visual tracking based on online sparse feature learning
    Wang, Zelun
    Wang, Jinjun
    Zhang, Shun
    Gong, Yihong
    [J]. IMAGE AND VISION COMPUTING, 2015, 38 : 24 - 32
  • [47] Enhancing Automatic Modulation Recognition Through Robust Global Feature Extraction
    Qu, Yunpeng
    Lu, Zhilin
    Zeng, Rui
    Wang, Jintao
    Wang, Jian
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (03) : 4192 - 4207
  • [48] Online Multi-Object Tracking With Visual and Radar Features
    Bae, Seung-Hwan
    [J]. IEEE ACCESS, 2020, 8 (08): : 90324 - 90339
  • [49] Adaptive Online Learning Based Robust Visual Tracking
    Yang, Weiming
    Zhao, Meirong
    Huang, Yinguo
    Zheng, Yelong
    [J]. IEEE ACCESS, 2018, 6 : 14790 - 14798
  • [50] Siamese Network with Channel-wise Attention and Multi-scale Fusion for Robust Object Tracking
    Tang, Eryong
    Wang, Yusheng
    Liu, Ye
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6515 - 6520