Visual object tracking by using ranking loss and spatial-temporal features

被引:0
|
作者
Saribas, Hasan [2 ]
Cevikalp, Hakan [1 ]
Kahvecioglu, Sinem [3 ]
机构
[1] Eskisehir Osmagazi Univ, Machine Learning & Comp Vis Lab, Elect & Elect Engn, Eskisehir, Turkiye
[2] Huawei Turkey R&D Ctr, Istanbul, Turkiye
[3] Eskisehir Tech Univ, Fac Aeronaut & Astronaut, Dept Avion, Eskisehir, Turkiye
关键词
Object tracking; Ranking loss; Two-stream network; Temporal features;
D O I
10.1007/s00138-023-01381-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel two-stream deep neural network tracker for robust object tracking. In the proposed network, we use both spatial and temporal features and employ a novel loss function called ranking loss. The class confidence scores coming from the two-stream (spatial and temporal) networks are fused at the end for final decision. Using ranking loss in the proposed tracker enforces the networks to learn giving higher scores to the candidate regions that frame the target object better. As a result, the tracker returns more precise bounding boxes framing the target object, and the risk of tracking error accumulation and drifts are largely mitigated when the proposed network architecture is used with a simple yet effective model update rule. We conducted extensive experiments on six different benchmarks, including OTB-2015, VOT-2017, TC-128, DTB70, NfS and UAV123. Our proposed tracker achieves the state-of-the-art results on the most of the tested challenging tracking datasets. Especially, our results on the OTB-2015, DTB70, NfS and TC-128 datasets are very promising.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Dual integration of multi-model with spatial-temporal occlusion-awareness for visual object tracking
    Wang, Fei
    Liu, Guixi
    Zhang, Yi
    IET IMAGE PROCESSING, 2022, 16 (11) : 2890 - 2910
  • [22] Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking
    Li, Feng
    Tian, Cheng
    Zuo, Wangmeng
    Zhang, Lei
    Yang, Ming-Hsuan
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4904 - 4913
  • [23] Spatial-temporal initialization dilemma: towards realistic visual tracking
    Chang Liu
    Yongsheng Yuan
    Xin Chen
    Huchuan Lu
    Dong Wang
    Visual Intelligence, 2 (1):
  • [24] STGL: Spatial-Temporal Graph Representation and Learning for Visual Tracking
    Jiang, Bo
    Zhang, Yuan
    Luo, Bin
    Cao, Xiaochun
    Tang, Jin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2162 - 2171
  • [25] Convolution operators for visual tracking based on spatial-temporal regularization
    Wang, Peng
    Sun, Mengyu
    Wang, Haiyan
    Li, Xiaoyan
    Yang, Yongxia
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (10): : 5339 - 5351
  • [26] LEARNING SPATIAL-TEMPORAL CONSISTENT CORRELATION FILTER FOR VISUAL TRACKING
    Lou, Han
    Wang, Dongfei
    Jiang, Zhuqing
    Men, Aidong
    Zhou, Yun
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [27] SPRTracker: Learning Spatial-Temporal Pixel Aggregations for Multiple Object Tracking
    Liu, Jialin
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2732 - 2736
  • [28] Spatial-temporal graph Transformer for object tracking against noise interference
    Li, Ning
    Sang, Haiwei
    Zheng, Jiamin
    Ma, Huawei
    Wang, Xiaoying
    Xiao, Fu'an
    INFORMATION SCIENCES, 2024, 678
  • [29] Dynamic feature fusion with spatial-temporal context for robust object tracking
    Nai, Ke
    Li, Zhiyong
    Wang, Haidong
    PATTERN RECOGNITION, 2022, 130
  • [30] MASK GUIDED SPATIAL-TEMPORAL FUSION NETWORK FOR MULTIPLE OBJECT TRACKING
    Zhao, Shuangye
    Wu, Yubin
    Wang, Shuai
    Ke, Wei
    Sheng, Hao
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3231 - 3235