End-to-End Correlation Tracking With Enhanced Multi-Level Feature Fusion

被引:1
|
作者
Liu, Guangen [1 ]
Liu, Guizhong [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
关键词
Target tracking; Correlation; Visualization; Semantics; Feature extraction; Fuses; Information filters; Visual tracking; correlation filters; deep features; multi-level feature fusion; OBJECT TRACKING;
D O I
10.1109/ACCESS.2021.3111532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discriminative correlation filters (DCF) have drawn increasing interest in visual tracking. In particular, a few recent works treat DCF as a special layer and add it into a Siamese network for visual tracking. However, most of them adopt shallow networks to learn target representations, which lack robust semantic information of deeper layers and make these works fail to handle significant appearance changes. In this paper, we design a novel Siamese network to fuse high-level semantic features and low-level spatial detail features for correlation tracking. Specifically, to introduce more semantic information into low-level features, we specially design a residual semantic embedding module to adaptively involve more semantic information from high-level features to guide the feature fusion. Furthermore, we adopt an effective and efficient channel attention mechanism to filter out noise information and make the network focus more on valuable features that are beneficial for visual tracking. The overall architecture is trained end-to-end offline to adaptively learn target representations, which are not only enabled to encode high-level semantic features and low-level spatial detail features, but also closely related to correlation filters. Experimental results on widely used OTB2013, OTB2015, VOT2016, TC-128, and UAV123 benchmarks show that our proposed tracker performs favorably against several state-of-the-art trackers.
引用
收藏
页码:128827 / 128840
页数:14
相关论文
共 50 条
  • [31] Multi-level temporal feature fusion with feature exchange strategy for multiple object tracking
    Ge, Yisu
    Ye, Wenjie
    Zhang, Guodao
    Lin, Mengying
    OPTOELECTRONICS LETTERS, 2024, 20 (08) : 505 - 512
  • [32] Multi-level temporal feature fusion with feature exchange strategy for multiple object tracking
    GE Yisu
    YE Wenjie
    ZHANG Guodao
    LIN Mengying
    Optoelectronics Letters, 2024, 20 (08) : 505 - 512
  • [33] Tracking Ransomware End-to-end
    Huang, Danny Yuxing
    Aliapoulios, Maxwell Matthaios
    Li, Vector Guo
    Invernizzi, Luca
    McRoberts, Kylie
    Bursztein, Elie
    Levin, Jonathan
    Levchenko, Kirill
    Snoeren, Alex C.
    McCoy, Damon
    2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2018, : 618 - 631
  • [34] QKD Key Provisioning With Multi-Level Pool Slicing for End-to-End Security Services in Optical Networks
    Zhu, Qingcheng
    Yu, Xiaosong
    Zhao, Yongli
    Nag, Avishek
    Zhang, Jie
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 2153 - 2169
  • [35] End-to-end Flow Correlation Tracking with Spatial-temporal Attention
    Zhu, Zheng
    Wu, Wei
    Zou, Wei
    Yan, Junjie
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 548 - 557
  • [36] SPCNet: Scale Position Correlation Network for End-to-End Visual Tracking
    Wang, Qiang
    Gao, Jin
    Zhang, Mengdan
    Xing, Junliang
    Hu, Weiming
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1803 - 1808
  • [37] INTERACTIVE FEATURE FUSION FOR END-TO-END NOISE-ROBUST SPEECH RECOGNITION
    Hu, Yuchen
    Hou, Nana
    Chen, Chen
    Chng, Eng Siong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6292 - 6296
  • [38] Feature Fusion Strategies for End-to-End Evaluation of Cognitive Behavior Therapy Sessions
    Chen, Zhuohao
    Flemotomos, Nikolaos
    Ardulov, Victor
    Creed, Torrey A.
    Imel, Zac E.
    Atkins, David C.
    Narayanan, Shrikanth
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 1836 - 1839
  • [39] End-to-End Bloody Video Recognition by Audio-Visual Feature Fusion
    Hou, Congcong
    Wu, Xiaoyu
    Wang, Ge
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 501 - 510
  • [40] CrossFuser: Multi-Modal Feature Fusion for End-to-End Autonomous Driving Under Unseen Weather Conditions
    Wu, Weishang
    Deng, Xiaoheng
    Jiang, Ping
    Wan, Shaohua
    Guo, Yuanxiong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 14378 - 14392