Weakly Supervised Multi-Modal 3D Human Body Pose Estimation for Autonomous Driving

被引:6
作者
Bauer, Peter [1 ]
Bouazizi, Arij [2 ]
Kressel, Ulrich [3 ]
Flohr, Fabian B. [4 ]
机构
[1] Univ Stuttgart, Keplerstr 7, D-70174 Stuttgart, Germany
[2] Friedrich Alexander Univ Erlangen Nuernberg, Cauerstr 7, D-91058 Erlangen, Germany
[3] Univ Ulm, Albert Einstein Allee 41, D-89081 Ulm, Germany
[4] Munich Univ Appl Sci, Intelligent Vehicles Lab, Lothstr 34, D-80335 Munich, Germany
来源
2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV | 2023年
关键词
Autonomous Driving; Human Pose Estimation; Computer Vision; Sensor Fusion;
D O I
10.1109/IV55152.2023.10186575
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate 3D human pose estimation (3D HPE) is crucial for enabling autonomous vehicles (AVs) to make informed decisions and respond proactively in critical road scenarios. Promising results of 3D HPE have been gained in several domains such as human-computer interaction, robotics, sports and medical analytics, often based on data collected in well-controlled laboratory environments. Nevertheless, the transfer of 3D HPE methods to AVs has received limited research attention, due to the challenges posed by obtaining accurate 3D pose annotations and the limited suitability of data from other domains. We present a simple yet efficient weakly supervised approach for 3D HPE in the AV context by employing a high-level sensor fusion between camera and LiDAR data. The weakly supervised setting enables training on the target datasets without any 2D / 3D keypoint labels by using an off-the-shelf 2D joint extractor and pseudo labels generated from LiDAR to image projections. Our approach outperforms state-of-the-art results by up to similar to 13% on the Waymo Open Dataset in the weakly supervised setting and achieves state-of-the-art results in the supervised setting.
引用
收藏
页数:7
相关论文
共 40 条
[1]  
Bouazizi A., 2021, 2021 17 IEEE INT C A, P1
[2]  
Bouazizi A, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P791
[3]   Self-Supervised 3D Human Pose Estimation with Multiple-View Geometry [J].
Bouazizi, Arij ;
Wiederer, Julian ;
Kressel, Ulrich ;
Belagiannis, Vasileios .
2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
[4]   Simple Pair Pose - Pairwise Human Pose Estimation in Dense Urban Traffic Scenes [J].
Braun, Markus ;
Flohr, Fabian B. ;
Krebs, Sebastian ;
Kressel, Ulrich ;
Gavrila, Dariu M. .
2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, :1545-1552
[5]  
Braun M, 2020, IEEE INT VEH SYM, P1694, DOI 10.1109/IV47402.2020.9304557
[6]   EuroCity Persons: A Novel Benchmark for Person Detection in Traffic Scenes [J].
Braun, Markus ;
Krebs, Sebastian ;
Flohr, Fabian ;
Gavrila, Dariu M. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (08) :1844-1861
[7]  
Caesar H, 2020, PROC CVPR IEEE, P11618, DOI 10.1109/CVPR42600.2020.01164
[8]  
Cao Z, 2020, Img Proc Comp Vis Re, V12346, P387, DOI 10.1007/978-3-030-58452-8_23
[9]   Optimizing Network Structure for 3D Human Pose Estimation [J].
Ci, Hai ;
Wang, Chunyu ;
Ma, Xiaoxuan ;
Wang, Yizhou .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2262-2271
[10]  
Czech P., 2022, arXiv