Exploiting weak mask representation with convolutional neural networks for accurate object tracking

被引:0
|
作者
Jianglei Huang
Wengang Zhou
Qi Tian
Houqiang Li
机构
[1] University of Science and Technology of China,CAS Key Laboratory of Technology in GIPAS, EEIS Department
[2] University of Texas at San Antonio,undefined
来源
关键词
Object tracking; Deep learning; Mask representation; Data augmentation; Bounding box approximation;
D O I
暂无
中图分类号
学科分类号
摘要
Recent years have witnessed the popularity of Convolutional Neural Networks (CNN) in a variety of computer vision tasks, including video object tracking. Existing object tracking methods with CNN employ either a scalar score or a confidence map as CNN’s output, which suffer the infeasibility of estimating the object’s accurate scale and rotation angle. Specifically, as with other traditional methods, they assume the targets’ scale aspect ratio and rotation angle are fixed. To address the limitation, we propose to take a binary mask as the output of CNN for tracking. To this end, we adapt a semantic segmentation model by online fine-tuning with augmented samples in the initial frame to uncover the target in the following frames. During the generation of training samples, we employ a Crop and Paste method to better utilize context information, add a random value to lightness component to mimic the illumination change, and take a Gaussian filtering approach to mimic the blur. During the tracking, due to the limitation of CNN’s receptive field size and spatial resolution, the network may fail to identify the target if the estimated bounding box is considerably incorrect. Therefore we propose a bounding box approximation method by considering temporal consistency. Excluding the initial training cost, our tracker runs at 41 FPS on a single GeForce 1080Ti GPU. Evaluated on benchmarks including OTB-2015, VOT-2016 and TempleColor, it achieves comparable results with non real-time top trackers and state-of-the-art performance among those real-time ones.
引用
收藏
页码:20961 / 20985
页数:24
相关论文
共 50 条
  • [21] Object Tracking Using Deep Convolutional Neural Networks and Visual Appearance Models
    Mocanu, Bogdan
    Tapu, Ruxandra
    Zaharia, Titus
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2017), 2017, 10617 : 114 - 125
  • [22] Exploiting Cyclic Symmetry in Convolutional Neural Networks
    Dieleman, Sander
    De Fauw, Jeffrey
    Kavukcuoglu, Koray
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [23] Exploiting Weak Ties in Incomplete Network Datasets Using Simplified Graph Convolutional Neural Networks
    Bidoki, Neda H.
    Mantzaris, Alexander, V
    Sukthankar, Gita
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2020, 2 (02):
  • [24] Towards a fast and accurate road object detection algorithm based on convolutional neural networks
    Zhang, Qinghui
    Wan, Chenxia
    Han, Weiliang
    Bian, Shanfeng
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [25] VISUAL OBJECT TRACKING VIA GRAPH CONVOLUTIONAL REPRESENTATION
    Tu, Zhengzheng
    Zhou, Ajian
    Jiang, Bo
    Luo, Bin
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 234 - 239
  • [26] Accurate visual representation learning for single object tracking
    Bao, Hua
    Shu, Ping
    Wang, Qijun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24059 - 24079
  • [27] Accurate visual representation learning for single object tracking
    Hua Bao
    Ping Shu
    Qijun Wang
    Multimedia Tools and Applications, 2022, 81 : 24059 - 24079
  • [28] Online Tracking with Convolutional Neural Networks
    Liu, Xiaodong
    Zhou, Yue
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 208 - 216
  • [29] Fast Multi-Object Tracking Using Convolutional Neural Networks with Tracklets Updating
    Zhang, Yuanping
    Tang, Yuanyan
    Fang, Bin
    Shang, Zhaowei
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 313 - 317
  • [30] Multiple object tracking based on appearance and motion graph convolutional neural networks with an explainer
    Zhang Y.
    Huang Q.
    Zheng L.
    Neural Computing and Applications, 2024, 36 (22) : 13799 - 13814