Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking

被引:14
|
作者
Li, Peiliang [1 ]
Shi, Jieqi [1 ]
Shen, Shaojie [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR42600.2020.00691
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Directly learning multiple 3D objects motion from sequential images is difficult, while the geometric bundle adjustment lacks the ability to localize the invisible object centroid. To benefit from both the powerful object understanding skill from deep neural network meanwhile tackle precise geometry modeling for consistent trajectory estimation, we propose a joint spatial-temporal optimization-based stereo 3D object tracking method. From the network, we detect corresponding 2D bounding boxes on adjacent images and regress an initial 3D bounding box. Dense object cues (local depth and local coordinates) that associating to the object centroid are then predicted using a region-based network. Considering both the instant localization accuracy and motion consistency, our optimization models the relations between the object centroid and observed cues into a joint spatial-temporal error function. All historic cues will be summarized to contribute to the current estimation by a per-frame marginalization strategy without repeated computation. Quantitative evaluation on the KITTI tracking dataset shows our approach outperforms previous image-based 3D tracking methods by significant margins. We also report extensive results on multiple categories and larger datasets (KITTI raw and Argoverse Racking) for future benchmarking.
引用
收藏
页码:6876 / 6885
页数:10
相关论文
共 50 条
  • [31] Spatial-temporal graph Transformer for object tracking against noise interference
    Li, Ning
    Sang, Haiwei
    Zheng, Jiamin
    Ma, Huawei
    Wang, Xiaoying
    Xiao, Fu'an
    INFORMATION SCIENCES, 2024, 678
  • [32] Dynamic feature fusion with spatial-temporal context for robust object tracking
    Nai, Ke
    Li, Zhiyong
    Wang, Haidong
    PATTERN RECOGNITION, 2022, 130
  • [33] MASK GUIDED SPATIAL-TEMPORAL FUSION NETWORK FOR MULTIPLE OBJECT TRACKING
    Zhao, Shuangye
    Wu, Yubin
    Wang, Shuai
    Ke, Wei
    Sheng, Hao
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3231 - 3235
  • [34] Object tracking based on adaptive updating of a spatial-temporal context model
    Feng, Wanli
    Cen, Yigang
    Zeng, Xianyou
    Li, Zhetao
    Zeng, Ming
    Voronin, Viacheslav
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (11): : 5459 - 5473
  • [35] Deep Spatial-Temporal Joint Feature Representation for Video Object Detection
    Zhao, Baojun
    Zhao, Boya
    Tang, Linbo
    Han, Yuqi
    Wang, Wenzheng
    SENSORS, 2018, 18 (03)
  • [36] A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization
    Wang, Jinqiao
    Xu, Min
    He, Xiangjian
    Lu, Hanqing
    Hoang, Doan
    SIGNAL PROCESSING, 2014, 94 : 33 - 47
  • [37] A 3D wavelet based spatial-temporal approach for video watermarking
    Li, Y
    Gao, XB
    Ji, HB
    ICCIMA 2003: FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2003, : 260 - 265
  • [38] Joint Learning Spatial-Temporal Attention Correlation Filters for Aerial Tracking
    Zhao, Bo
    Ma, Sugang
    Zhao, Zhixian
    Zhang, Lei
    Hou, Zhiqiang
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 686 - 690
  • [39] Spatial light modulation for improved microscope stereo vision and 3D tracking
    Lee, Michael P.
    Gibson, Graham
    Tassieri, Manlio
    Phillips, Dave
    Bernet, Stefan
    Ritsch-Marte, Monika
    Padgett, Miles J.
    OPTICAL TRAPPING AND OPTICAL MICROMANIPULATION X, 2013, 8810
  • [40] Stereo3DMOT: Stereo Vision Based 3D Multi-object Tracking with Multimodal ReID
    Mao, Chen
    Tan, Chong
    Liu, Hong
    Hu, Jingqi
    Zheng, Min
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 495 - 507