MLVSNet: Multi-level Voting Siamese Network for 3D Visual Tracking

被引:33
作者
Wang, Zhoutao [1 ]
Xie, Qian [1 ]
Lai, Yu-Kun [2 ]
Wu, Jing [2 ]
Long, Kun [1 ]
Wang, Jun [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China
[2] Cardiff Univ, Cardiff, S Glam, Wales
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV48922.2021.00309
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Benefiting from the excellent performance of Siamese-based trackers, huge progress on 2D visual tracking has been achieved. However, 3D visual tracking is still under-explored. Inspired by the idea of Hough voting in 3D object detection, in this paper, we propose a Multi-level Voting Siamese Network (MLVSNet) for 3D visual tracking from outdoor point cloud sequences. To deal with sparsity in outdoor 3D point clouds, we propose to perform Hough voting on multi-level features to get more vote centers and retain more useful information, instead of voting only on the final level feature as in previous methods. We also design an efficient and lightweight Target-Guided Attention (TGA) module to transfer the target information and highlight the target points in the search area. Moreover, we propose a Vote-cluster Feature Enhancement (VFE) module to exploit the relationships between different vote clusters. Extensive experiments on the 3D tracking benchmark of KITTI dataset demonstrate that our MLVSNet outperforms state-of-the-art methods with significant margins. Code will be available at https://github.com/CodeWZT/MLVSNet.
引用
收藏
页码:3081 / 3090
页数:10
相关论文
共 48 条
  • [21] SiamRPN plus plus : Evolution of Siamese Visual Tracking with Very Deep Networks
    Li, Bo
    Wu, Wei
    Wang, Qiang
    Zhang, Fangyi
    Xing, Junliang
    Yan, Junjie
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4277 - 4286
  • [22] High Performance Visual Tracking with Siamese Region Proposal Network
    Li, Bo
    Yan, Junjie
    Wu, Wei
    Zhu, Zheng
    Hu, Xiaolin
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8971 - 8980
  • [23] Deep visual tracking: Review and experimental comparison
    Li, Peixia
    Wang, Dong
    Wang, Lijun
    Lu, Huchuan
    [J]. PATTERN RECOGNITION, 2018, 76 : 323 - 338
  • [24] Li WC, 2017, INT CONF ACOUST SPEE, P3156, DOI 10.1109/ICASSP.2017.7952738
  • [25] Liu Y., 2018, TMM, V21, P664
  • [26] Visual attention feature (VAF) : A novel strategy for visual tracking based on cloud platform in intelligent surveillance systems
    Pan, Zheng
    Liu, Shuai
    Sangaiah, Arun Kumar
    Muhammad, Khan
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 120 : 182 - 194
  • [27] Park J., 2018, arXiv preprint arXiv:1807.06514
  • [28] Pieropan A, 2015, IEEE INT CONF ROBOT, P2410, DOI 10.1109/ICRA.2015.7139520
  • [29] ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes
    Qi, Charles R.
    Chen, Xinlei
    Litany, Or
    Guibas, Leonidas J.
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4403 - 4412
  • [30] Deep Hough Voting for 3D Object Detection in Point Clouds
    Qi, Charles R.
    Litany, Or
    He, Kaiming
    Guibas, Leonidas J.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9276 - 9285