End-to-end Deep Object Tracking with Circular Loss Function for Rotated Bounding Box

被引:0
|
作者
Belyaev, Vladislav [1 ]
Malysheva, Aleksandra [1 ]
Shpilman, Aleksei [1 ]
机构
[1] Natl Res Univ Higher Sch Econ, JetBrains Res, St Petersburg, Russia
来源
2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY) | 2019年
关键词
visual tracking; transformer; siamese networks;
D O I
10.1109/redundancy48165.2019.9003330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The task object tracking is vital in numerous applications such as autonomous driving, intelligent surveillance, robotics, etc. This task entails the assigning of a bounding box to an object in a video stream, given only the bounding box for that object on the first frame. In 2015, a new type of video object tracking (VOT) dataset was created that introduced rotated bounding boxes as an extension of axis-aligned ones. In this work, we introduce a novel end-to-end deep learning method based on the Transformer Multi-Head Attention architecture. We also present a new type of loss function, which takes into account the bounding box overlap and orientation. Our Deep Object Tracking model with Circular Loss Function (DOTCL) shows an considerable improvement in terms of robustness over current state-of-the-art end-to-end deep learning models. It also outperforms state-of-the-art object tracking methods on VOT2018 dataset in terms of expected average overlap (EAO) metric.
引用
收藏
页码:165 / 170
页数:6
相关论文
共 50 条
  • [1] FPDIoU Loss: A loss function for efficient bounding box regression of rotated object detection
    Ma, Siliang
    Xu, Yong
    IMAGE AND VISION COMPUTING, 2025, 154
  • [2] Object Bounding Transformed Network for End-to-End Semantic Segmentation
    Wang, Kuan-Chung
    Wang, Chien-Yao
    Tai, Tzu-Chiang
    Wang, Jia-Ching
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3217 - 3221
  • [3] RQFormer: Rotated Query Transformer for end-to-end oriented object detection
    Zhao, Jiaqi
    Ding, Zeyu
    Zhou, Yong
    Zhu, Hancheng
    Du, Wen-Liang
    Yao, Rui
    El Saddik, Abdulmotaleb
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
  • [4] Toward End-to-End Object Detection and Tracking on the Edge
    Tabkhi, Hamed
    SEC 2017: 2017 THE SECOND ACM/IEEE SYMPOSIUM ON EDGE COMPUTING (SEC'17), 2017,
  • [5] End-to-End Learning Deep CRF Models for Multi-Object Tracking Deep CRF Models
    Xiang, Jun
    Xu, Guohan
    Ma, Chao
    Hou, Jianhua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 275 - 288
  • [6] End-to-end deep metric network for visual tracking
    Tian, Shengjing
    Shen, Shuwei
    Tian, Guoqiang
    Liu, Xiuping
    Yin, Baocai
    VISUAL COMPUTER, 2020, 36 (06): : 1219 - 1232
  • [7] End-to-end deep metric network for visual tracking
    Shengjing Tian
    Shuwei Shen
    Guoqiang Tian
    Xiuping Liu
    Baocai Yin
    The Visual Computer, 2020, 36 : 1219 - 1232
  • [8] End-to-end Active Object Tracking via Reinforcement Learning
    Luo, Wenhan
    Sun, Peng
    Zhong, Fangwei
    Liu, Wei
    Zhang, Tong
    Wang, Yizhou
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [9] MOTR: End-to-End Multiple-Object Tracking with Transformer
    Zeng, Fangao
    Dong, Bin
    Zhang, Yuang
    Wang, Tiancai
    Zhang, Xiangyu
    Wei, Yichen
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 659 - 675
  • [10] End-to-end Visual Object Tracking with Motion Saliency Guidance
    Zhang, Yucheng
    Liu, Kexin
    Wang, Tian
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6566 - 6571