End-to-End Correlation Tracking With Enhanced Multi-Level Feature Fusion

被引:1
|
作者
Liu, Guangen [1 ]
Liu, Guizhong [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
关键词
Target tracking; Correlation; Visualization; Semantics; Feature extraction; Fuses; Information filters; Visual tracking; correlation filters; deep features; multi-level feature fusion; OBJECT TRACKING;
D O I
10.1109/ACCESS.2021.3111532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discriminative correlation filters (DCF) have drawn increasing interest in visual tracking. In particular, a few recent works treat DCF as a special layer and add it into a Siamese network for visual tracking. However, most of them adopt shallow networks to learn target representations, which lack robust semantic information of deeper layers and make these works fail to handle significant appearance changes. In this paper, we design a novel Siamese network to fuse high-level semantic features and low-level spatial detail features for correlation tracking. Specifically, to introduce more semantic information into low-level features, we specially design a residual semantic embedding module to adaptively involve more semantic information from high-level features to guide the feature fusion. Furthermore, we adopt an effective and efficient channel attention mechanism to filter out noise information and make the network focus more on valuable features that are beneficial for visual tracking. The overall architecture is trained end-to-end offline to adaptively learn target representations, which are not only enabled to encode high-level semantic features and low-level spatial detail features, but also closely related to correlation filters. Experimental results on widely used OTB2013, OTB2015, VOT2016, TC-128, and UAV123 benchmarks show that our proposed tracker performs favorably against several state-of-the-art trackers.
引用
收藏
页码:128827 / 128840
页数:14
相关论文
共 50 条
  • [1] End-to-End Chained Pedestrian Multi-Object Tracking Based on Multi-Feature Fusion
    Zhou, Haiyun
    Xiang, Xuezhi
    Wang, Xinyao
    Ren, Wenkai
    PROCEEDINGS OF 2021 IEEE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2021, : 150 - 153
  • [2] A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition
    Li, Jin
    Su, Rongfeng
    Xie, Xurong
    Yan, Nan
    Wang, Lan
    INTERSPEECH 2022, 2022, : 3173 - 3177
  • [3] End-to-end feature fusion Siamese network for adaptive visual tracking
    Guo, Dongyan
    Wang, Jun
    Zhao, Weixuan
    Cui, Ying
    Wang, Zhenhua
    Chen, Shengyong
    IET IMAGE PROCESSING, 2021, 15 (01) : 91 - 100
  • [4] A Multi-Level Optimization Framework for End-to-End Text Augmentation
    Somayajula, Sai Ashish
    Song, Linfeng
    Xie, Pengtao
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 343 - 358
  • [5] End-to-End Feature Integration for Correlation Filter Tracking With Channel Attention
    Li, Dongdong
    Wen, Gongjian
    Kuai, Yangliu
    Porikli, Fatih
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (12) : 1815 - 1819
  • [6] An end-to-end multi-resolution feature fusion defogging network
    Ping Xue
    ShiXiong Deng
    Signal, Image and Video Processing, 2023, 17 : 4189 - 4197
  • [7] An end-to-end multi-resolution feature fusion defogging network
    Xue, Ping
    Deng, ShiXiong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (08) : 4189 - 4197
  • [8] Multi-Level Modeling Units for End-to-End Mandarin Speech Recognition
    Yang, Yuting
    Du, Binbin
    Li, Yuke
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 175 - 179
  • [9] Multi-Modal Fusion for End-to-End RGB-T Tracking
    Zhang, Lichao
    Danelljan, Martin
    Gonzalez-Garcia, Abel
    van de Weijer, Joost
    Khan, Fahad Shahbaz
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2252 - 2261
  • [10] End-to-end relation extraction based on bootstrapped multi-level distant supervision
    He, Ying
    Li, Zhixu
    Yang, Qiang
    Chen, Zhigang
    Liu, An
    Zhao, Lei
    Zhou, Xiaofang
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (05): : 2933 - 2956