Motion-to-Matching: A Mixed Paradigm for 3D Single Object Tracking

被引:1
|
作者
Li, Zhiheng [1 ,2 ,3 ]
Lin, Yu [1 ]
Cui, Yubo [1 ]
Li, Shuo [1 ]
Fang, Zheng [1 ,2 ,3 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Natl Frontiers Sci Ctr Ind Intelligence & Syst Opt, Shenyang 110819, Peoples R China
[3] Northeastern Univ, Key Lab Data Analyt & Optimizat Smart Ind, Minist Educ, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Three-dimensional displays; Feature extraction; Point cloud compression; Transformers; Object tracking; Image matching; 3D object tracking; deep learning; point clouds;
D O I
10.1109/LRA.2023.3347143
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
3D single object tracking with LiDAR points is an important task in the computer vision field. Previous methods usually adopt the matching-based or motion-centric paradigms to estimate the current target status. However, the former is sensitive to the similar distractors and the sparseness of point clouds due to relying on appearance matching, while the latter usually focuses on short-term motion clues (eg. two frames) and ignores the long-term motion pattern of target. To address these issues, we propose a mixed paradigm with two stages, named MTM-Tracker, which combines motion modeling with feature matching into a single network. Specifically, in the first stage, we exploit the continuous historical boxes as motion prior and propose an encoder-decoder structure to locate target coarsely. Then, in the second stage, we introduce a feature interaction module to extract motion-aware features from consecutive point clouds and match them to refine target movement as well as regress other target states. Extensive experiments validate that our paradigm achieves competitive performance on large-scale datasets (70.9% in KITTI and 51.70% in NuScenes).
引用
收藏
页码:1468 / 1475
页数:8
相关论文
共 50 条
  • [31] 3D Object Representation Learning: A Set-to-Set Matching Perspective
    Yu, Tan
    Meng, Jingjing
    Yang, Ming
    Yuan, Junsong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2168 - 2179
  • [32] 3D Object Tracking Using Disparity Map
    Khan, Wasim Akram
    Pant, Dibakar Raj
    Adhikari, Bhisma
    Manandhar, Rasana
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 1015 - 1018
  • [33] SpOT: Spatiotemporal Modeling for 3D Object Tracking
    Stearns, Colton
    Rempe, Davis
    Li, Jie
    Ambrus, Rare
    Zakharov, Sergey
    Guizilini, Vitor
    Yang, Yanchao
    Guibas, Leonidas J.
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 639 - 656
  • [34] MCCA-MOT: Multimodal Collaboration-Guided Cascade Association Network for 3D Multi-Object Tracking
    Li, Hui
    Liu, Hengyuan
    Du, Zhenzhen
    Chen, Zhong
    Tao, Ye
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 974 - 989
  • [35] Density Aware 3D Object Single Stage Detector
    Ning, Jingmei
    Da, Feipeng
    Gai, Shaoyan
    IEEE SENSORS JOURNAL, 2021, 21 (20) : 23108 - 23117
  • [36] CSANet: Cuboid-Wise Shape Augmentation 3D Object Detector for Occluded Targets
    Lin, Jia
    Ge, Wancheng
    Rigoll, Gerhard
    Yin, Huilin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1750 - 1754
  • [37] Super Sparse 3D Object Detection
    Fan, Lue
    Yang, Yuxue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12490 - 12505
  • [38] SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection
    Dai, Kun
    Jiang, Zhiqiang
    Xie, Tao
    Wang, Ke
    Liu, Dedong
    Fan, Zhendong
    Li, Ruifeng
    Zhao, Lijun
    Omar, Mohamed
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 637 - 651
  • [39] DA-Net: Density-Aware 3D Object Detection Network for Point Clouds
    Wang, Shuhua
    Lu, Ke
    Xue, Jian
    Zhao, Yang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 665 - 678
  • [40] Extended Keyframe Detection with Stable Tracking for Multiple 3D Object Tracking
    Park, Youngmin
    Lepetit, Vincent
    Woo, Woontack
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (11) : 1728 - 1735