End-to-End Correlation Tracking With Enhanced Multi-Level Feature Fusion

被引:1
|
作者
Liu, Guangen [1 ]
Liu, Guizhong [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
关键词
Target tracking; Correlation; Visualization; Semantics; Feature extraction; Fuses; Information filters; Visual tracking; correlation filters; deep features; multi-level feature fusion; OBJECT TRACKING;
D O I
10.1109/ACCESS.2021.3111532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discriminative correlation filters (DCF) have drawn increasing interest in visual tracking. In particular, a few recent works treat DCF as a special layer and add it into a Siamese network for visual tracking. However, most of them adopt shallow networks to learn target representations, which lack robust semantic information of deeper layers and make these works fail to handle significant appearance changes. In this paper, we design a novel Siamese network to fuse high-level semantic features and low-level spatial detail features for correlation tracking. Specifically, to introduce more semantic information into low-level features, we specially design a residual semantic embedding module to adaptively involve more semantic information from high-level features to guide the feature fusion. Furthermore, we adopt an effective and efficient channel attention mechanism to filter out noise information and make the network focus more on valuable features that are beneficial for visual tracking. The overall architecture is trained end-to-end offline to adaptively learn target representations, which are not only enabled to encode high-level semantic features and low-level spatial detail features, but also closely related to correlation filters. Experimental results on widely used OTB2013, OTB2015, VOT2016, TC-128, and UAV123 benchmarks show that our proposed tracker performs favorably against several state-of-the-art trackers.
引用
收藏
页码:128827 / 128840
页数:14
相关论文
共 50 条
  • [11] End-to-end response selection based on multi-level context response matching
    Boussaha, Basma El Amel
    Hernandez, Nicolas
    Jacquin, Christine
    Morin, Emmanuel
    COMPUTER SPEECH AND LANGUAGE, 2020, 63
  • [12] End-to-end relation extraction based on bootstrapped multi-level distant supervision
    Ying He
    Zhixu Li
    Qiang Yang
    Zhigang Chen
    An Liu
    Lei Zhao
    Xiaofang Zhou
    World Wide Web, 2020, 23 : 2933 - 2956
  • [13] End-to-End Feature Decontaminated Network for UAV Tracking
    Zuo, Haobo
    Fu, Changhong
    Li, Sihang
    Ye, Junjie
    Zheng, Guangze
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 12130 - 12137
  • [14] End-to-End Multi-View Fusion for Enhanced Perception and Motion Prediction
    Khalil, Yasser H.
    Mouftah, Hussein T.
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [15] Band Regrouping and Response-Level Fusion for End-to-End Hyperspectral Object Tracking
    Ouyang, Er
    Wu, Jianhui
    Li, Bin
    Zhao, Lin
    Hu, Wenjing
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [16] An end-to-end tracking framework via multi-view and temporal feature aggregation
    Yang, Yihan
    Xu, Ming
    Ralph, Jason F.
    Ling, Yuchen
    Pan, Xiaonan
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [17] DEEP FEATURE BASED END-TO-END TRANSPORTATION NETWORK FOR MULTI-TARGET TRACKING
    Ullah, Mohib
    Cheikh, Faouzi Alaya
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3738 - 3742
  • [18] An End-to-End Multi-Level Wavelet Convolutional Neural Networks for heart diseases diagnosis
    El Bouny, Lahcen
    Khalil, Mohammed
    Adib, Abdellah
    NEUROCOMPUTING, 2020, 417 : 187 - 201
  • [19] MULTI-LEVEL LANGUAGE MODELING AND DECODING FOR OPEN VOCABULARY END-TO-END SPEECH RECOGNITION
    Hori, Takaaki
    Watanabe, Shinji
    Hershey, John R.
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 287 - 293
  • [20] Triplet Network with Multi-level Feature Fusion for Object Tracking
    Cao, Yang
    Wan, Bo
    Wang, Quan
    Cheng, Fei
    2020 JOINT 9TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2020 4TH INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2020,