Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking

被引:14
|
作者
Li, Peiliang [1 ]
Shi, Jieqi [1 ]
Shen, Shaojie [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR42600.2020.00691
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Directly learning multiple 3D objects motion from sequential images is difficult, while the geometric bundle adjustment lacks the ability to localize the invisible object centroid. To benefit from both the powerful object understanding skill from deep neural network meanwhile tackle precise geometry modeling for consistent trajectory estimation, we propose a joint spatial-temporal optimization-based stereo 3D object tracking method. From the network, we detect corresponding 2D bounding boxes on adjacent images and regress an initial 3D bounding box. Dense object cues (local depth and local coordinates) that associating to the object centroid are then predicted using a region-based network. Considering both the instant localization accuracy and motion consistency, our optimization models the relations between the object centroid and observed cues into a joint spatial-temporal error function. All historic cues will be summarized to contribute to the current estimation by a per-frame marginalization strategy without repeated computation. Quantitative evaluation on the KITTI tracking dataset shows our approach outperforms previous image-based 3D tracking methods by significant margins. We also report extensive results on multiple categories and larger datasets (KITTI raw and Argoverse Racking) for future benchmarking.
引用
收藏
页码:6876 / 6885
页数:10
相关论文
共 50 条
  • [21] Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object Segmentation
    Sun, Jiadai
    Dai, Yuchao
    Zhang, Xianjing
    Xu, Jintao
    Ai, Rui
    Gu, Weihao
    Chen, Xieyuanli
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 11456 - 11463
  • [22] Real-time Spatial-temporal Context Approach for 3D Object Detection using LiDAR
    Kumar, K. S. Chidanand
    Al-Stouhi, Samir
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS (VEHITS), 2020, : 432 - 439
  • [23] Spatial-temporal Concept based Explanation of 3D ConvNets
    Ji, Ying
    Wang, Yu
    Kato, Jien
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15444 - 15453
  • [24] Spatial-Temporal Transformer for 3D Point Cloud Sequences
    Wei, Yimin
    Liu, Hao
    Xie, Tingting
    Ke, Qiuhong
    Guo, Yulan
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 657 - 666
  • [25] Joint 3D Tracking of a Deformable Object in Interaction with a Hand
    Tsoli, Aggeliki
    Argyros, Antonis A.
    COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 504 - 520
  • [26] Joint optimization of detection and tracking for 3D sensors
    Wang, Guohong
    Mao, Shiyi
    He, You
    Wang, Xiaoqiang
    Yuhang Xuebao/Journal of Astronautics, 2001, 22 (06):
  • [27] Modeling of Multiple Spatial-Temporal Relations for Robust Visual Object Tracking
    Wang, Shilei
    Wang, Zhenhua
    Sun, Qianqian
    Cheng, Gong
    Ning, Jifeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5073 - 5085
  • [28] Visual object tracking by using ranking loss and spatial-temporal features
    Saribas, Hasan
    Cevikalp, Hakan
    Kahvecioglu, Sinem
    MACHINE VISION AND APPLICATIONS, 2023, 34 (02)
  • [29] SPRTracker: Learning Spatial-Temporal Pixel Aggregations for Multiple Object Tracking
    Liu, Jialin
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2732 - 2736
  • [30] Multi-Object Tracking with Spatial-Temporal Embedding Perception and Multi-Task Synergistic Optimization
    Liang, Xiaoguo
    Li, Hui
    Cheng, Yuanzhi
    Chen, Shuangmin
    Liu, Hengyuan
    Computer Engineering and Applications, 2024, 60 (06) : 282 - 292