Weakly Supervised Multi-Modal 3D Human Body Pose Estimation for Autonomous Driving

被引:6
作者
Bauer, Peter [1 ]
Bouazizi, Arij [2 ]
Kressel, Ulrich [3 ]
Flohr, Fabian B. [4 ]
机构
[1] Univ Stuttgart, Keplerstr 7, D-70174 Stuttgart, Germany
[2] Friedrich Alexander Univ Erlangen Nuernberg, Cauerstr 7, D-91058 Erlangen, Germany
[3] Univ Ulm, Albert Einstein Allee 41, D-89081 Ulm, Germany
[4] Munich Univ Appl Sci, Intelligent Vehicles Lab, Lothstr 34, D-80335 Munich, Germany
来源
2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV | 2023年
关键词
Autonomous Driving; Human Pose Estimation; Computer Vision; Sensor Fusion;
D O I
10.1109/IV55152.2023.10186575
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate 3D human pose estimation (3D HPE) is crucial for enabling autonomous vehicles (AVs) to make informed decisions and respond proactively in critical road scenarios. Promising results of 3D HPE have been gained in several domains such as human-computer interaction, robotics, sports and medical analytics, often based on data collected in well-controlled laboratory environments. Nevertheless, the transfer of 3D HPE methods to AVs has received limited research attention, due to the challenges posed by obtaining accurate 3D pose annotations and the limited suitability of data from other domains. We present a simple yet efficient weakly supervised approach for 3D HPE in the AV context by employing a high-level sensor fusion between camera and LiDAR data. The weakly supervised setting enables training on the target datasets without any 2D / 3D keypoint labels by using an off-the-shelf 2D joint extractor and pseudo labels generated from LiDAR to image projections. Our approach outperforms state-of-the-art results by up to similar to 13% on the Waymo Open Dataset in the weakly supervised setting and achieves state-of-the-art results in the supervised setting.
引用
收藏
页数:7
相关论文
共 40 条
[21]  
Li W., 2022, IEEE T MULTIMEDIA
[22]   MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation [J].
Li, Wenhao ;
Liu, Hong ;
Tang, Hao ;
Wang, Pichao ;
Van Gool, Luc .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :13137-13146
[23]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[24]   Learning Trajectory Dependencies for Human Motion Prediction [J].
Mao, Wei ;
Liu, Miaomiao ;
Salzmann, Mathieu ;
Li, Hongdong .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9488-9496
[25]   A simple yet effective baseline for 3d human pose estimation [J].
Martinez, Julieta ;
Hossain, Rayat ;
Romero, Javier ;
Little, James J. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2659-2668
[26]   On human motion prediction using recurrent neural networks [J].
Martinez, Julieta ;
Black, Michael J. ;
Romero, Javier .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4674-4683
[27]   3D human pose estimation in video with temporal convolutions and semi-supervised training [J].
Pavllo, Dario ;
Feichtenhofer, Christoph ;
Grangier, David ;
Auli, Michael .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7745-7754
[28]  
Qi CR, 2017, ADV NEUR IN, V30
[29]   Scalability in Perception for Autonomous Driving: Waymo Open Dataset [J].
Sun, Pei ;
Kretzschmar, Henrik ;
Dotiwalla, Xerxes ;
Chouard, Aurelien ;
Patnaik, Vijaysai ;
Tsui, Paul ;
Guo, James ;
Zhou, Yin ;
Chai, Yuning ;
Caine, Benjamin ;
Vasudevan, Vijay ;
Han, Wei ;
Ngiam, Jiquan ;
Zhao, Hang ;
Timofeev, Aleksei ;
Ettinger, Scott ;
Krivokon, Maxim ;
Gao, Amy ;
Joshi, Aditya ;
Zhang, Yu ;
Shlens, Jonathon ;
Chen, Zhifeng ;
Anguelov, Dragomir .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2443-2451
[30]  
Wang Jianhong, 2021, Advances in Neural Information Processing Systems, V34