DeepKalPose: An enhanced deep-learning Kalman filter for temporally consistent monocular vehicle pose estimation

被引:0
作者
Di Bella, Leandro [1 ,2 ]
Lyu, Yangxintong [1 ,2 ]
Munteanu, Adrian [1 ,2 ]
机构
[1] Vrije Univ Brussel, Dept Elect & Informat, Brussels, Belgium
[2] IMEC, Leuven, Belgium
关键词
artificial intelligence; Kalman filters; pose estimation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents DeepKalPose, a novel approach for enhancing temporal consistency in monocular vehicle pose estimation applied on video through a deep-learning-based Kalman filter. By integrating a bi-directional Kalman filter strategy utilizing forward and backward time-series processing, combined with a learnable motion model to represent complex motion patterns, the method significantly improves pose accuracy and robustness across various conditions, particularly for occluded or distant vehicles. Experimental validation on the KITTI dataset confirms that DeepKalPose outperforms existing methods in both pose accuracy and temporal consistency. In this paper, we propose a deep-learning-based Kalman filter to enhance temporal consistency in monocular vehicle pose estimation applied on video. A learnable motion model is integrated into a novel bi-directional time-series processing module, significantly improving pose accuracy and temporal coherency. The method demonstrates robustness across various conditions, particularly for occluded or distant vehicles, providing the potential for real-world applications. image
引用
收藏
页数:4
相关论文
共 26 条
  • [1] Brazil Garrick, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12368), P135, DOI 10.1007/978-3-030-58592-1_9
  • [2] Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking
    Cao, Jinkun
    Pang, Jiangmiao
    Weng, Xinshuo
    Khirodkar, Rawal
    Kitani, Kris
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9686 - 9696
  • [3] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
  • [4] Leveraging Shape Completion for 3D Siamese Tracking
    Giancola, Silvio
    Zarzar, Jesus
    Ghanem, Bernard
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1359 - 1368
  • [5] Model-Based Vehicle Pose Estimation and Tracking in Videos Using Random Forests
    Hoedlmoser, Michael
    Micusik, Branislav
    Pollefeys, Marc
    Liu, Ming-Yu
    Kampel, Martin
    [J]. 2013 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2013), 2013, : 430 - 437
  • [6] Monocular Quasi-Dense 3D Object Tracking
    Hu, Hou-Ning
    Yang, Yung-Hsu
    Fischer, Tobias
    Darrell, Trevor
    Yu, Fisher
    Sun, Min
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1992 - 2008
  • [7] Joint Monocular 3D Vehicle Detection and Tracking
    Hu, Hou-Ning
    Cai, Qi-Zhi
    Wang, Dequan
    Lin, Ji
    Sun, Min
    Krahenbuhl, Philipp
    Darrell, Trevor
    Yu, Fisher
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5389 - 5398
  • [8] PolarMOT: How Far Can Geometric Relations Take us in 3D Multi-object Tracking?
    Kim, Aleksandr
    Braso, Guillem
    Osep, Aljosa
    Leal-Taixe, Laura
    [J]. COMPUTER VISION, ECCV 2022, PT XXII, 2022, 13682 : 41 - 58
  • [9] Kingma D.P., 2015, P ICLR
  • [10] Lei Ke, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12360), P515, DOI 10.1007/978-3-030-58555-6_31