Deep End-to-end 3D Person Detection from Camera and Lidar

被引:0
|
作者
Roth, Markus [1 ,2 ]
Jargot, Dominik [2 ]
Gavrila, Dariu M. [2 ]
机构
[1] Daimler AG, Environm Percept, Stuttgart, Germany
[2] Delft Univ Technol, Intelligent Vehicles, Delft, Netherlands
关键词
D O I
10.1109/itsc.2019.8917366
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
We present a method for 3D person detection from camera images and lidar point clouds in automotive scenes. The method comprises a deep neural network which estimates the 3D location and extent of persons present in the scene. 3D anchor proposals are refined in two stages: a region proposal network and a subsequent detection network. For both input modalities high-level feature representations are learned from raw sensor data instead of being manually designed. To that end, we use Voxel Feature Encoders [1] to obtain point cloud features instead of widely used projection-based point cloud representations, thus allowing the network to learn to predict the location and extent of persons in an end-to-end manner. Experiments on the validation set of the KITTI 3D object detection benchmark [2] show that the proposed method outperforms state-of-the-art methods with an average precision (AP) of 47.06% on moderate difficulty.
引用
收藏
页码:521 / 527
页数:7
相关论文
共 50 条
  • [1] End-to-End 3D Object Detection using LiDAR Point Cloud
    Raut, Gaurav
    Patole, Advait
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [2] RGBDTCalibNet: End-to-end Online Extrinsic Calibration between a 3D LiDAR, an RGB Camera and a Thermal Camera
    Mharolkar, Sanat
    Zhang, Jun
    Peng, Guohao
    Liu, Yiyao
    Wang, Danwei
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3577 - 3582
  • [3] End-to-End Optimization of LiDAR Beam Configuration for 3D Object Detection and Localization
    Voedisch, Niclas
    Unal, Ozan
    Li, Ke
    Van Gool, Luc
    Dai, Dengxin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 2242 - 2249
  • [4] End-to-end Multiplayer Violence Detection based on Deep 3D CNN
    Li, Chengyang
    Zhu, Liping
    Zhu, Dandan
    Chen, Jiale
    Pan, Zhanghui
    Li, Xue
    Wang, Bing
    PROCEEDINGS OF 2018 VII INTERNATIONAL CONFERENCE ON NETWORK, COMMUNICATION AND COMPUTING (ICNCC 2018), 2018, : 227 - 230
  • [5] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
    Qian, Rui
    Garg, Divyansh
    Wang, Yan
    You, Yurong
    Belongie, Serge
    Hariharan, Bharath
    Campbell, Mark
    Weinberger, Kilian Q.
    Chao, Wei-Lun
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5880 - 5889
  • [6] LiDAR-as-Camera for End-to-End Driving
    Tampuu, Ardi
    Aidla, Romet
    van Gent, Jan Aare
    Matiisen, Tambet
    SENSORS, 2023, 23 (05)
  • [7] End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point Clouds
    Zhou, Yin
    Sun, Pei
    Zhang, Yu
    Anguelov, Dragomir
    Gao, Jiyang
    Ouyang, Tom
    Guo, James
    Ngiam, Jiquan
    Vasudevan, Vijay
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [8] BirdNet plus : End-to-End 3D Object Detection in LiDAR Bird's Eye View
    Barrera, Alejandro
    Guindel, Carlos
    Beltran, Jorge
    Garcia, Fernando
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [9] SparseDet: Towards End-to-End 3D Object Detection
    Han, Jianhong
    Wan, Zhaoyi
    Liu, Zhe
    Feng, Jie
    Zhou, Bingfeng
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 781 - 792
  • [10] An End-to-End Transformer Model for 3D Object Detection
    Misra, Ishan
    Girdhar, Rohit
    Joulin, Armand
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2886 - 2897