Deep End-to-end 3D Person Detection from Camera and Lidar

被引:0
|
作者
Roth, Markus [1 ,2 ]
Jargot, Dominik [2 ]
Gavrila, Dariu M. [2 ]
机构
[1] Daimler AG, Environm Percept, Stuttgart, Germany
[2] Delft Univ Technol, Intelligent Vehicles, Delft, Netherlands
关键词
D O I
10.1109/itsc.2019.8917366
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
We present a method for 3D person detection from camera images and lidar point clouds in automotive scenes. The method comprises a deep neural network which estimates the 3D location and extent of persons present in the scene. 3D anchor proposals are refined in two stages: a region proposal network and a subsequent detection network. For both input modalities high-level feature representations are learned from raw sensor data instead of being manually designed. To that end, we use Voxel Feature Encoders [1] to obtain point cloud features instead of widely used projection-based point cloud representations, thus allowing the network to learn to predict the location and extent of persons in an end-to-end manner. Experiments on the validation set of the KITTI 3D object detection benchmark [2] show that the proposed method outperforms state-of-the-art methods with an average precision (AP) of 47.06% on moderate difficulty.
引用
收藏
页码:521 / 527
页数:7
相关论文
共 50 条
  • [21] TesseTrack: End-to-End Learnable Multi-Person Articulated 3D Pose Tracking
    Reddy, N. Dinesh
    Guigues, Laurent
    Pishchulin, Leonid
    Eledath, Jayan
    Narasimhan, Srinivasa G.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15185 - 15195
  • [22] End-to-end Learning for 3D Facial Animation from Speech
    Pham, Hai X.
    Wang, Yuting
    Pavlovic, Vladimir
    ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 361 - 365
  • [23] An end-to-end workflow for nondestructive 3D pathology
    Bishop, Kevin W.
    Erion Barner, Lindsey A.
    Han, Qinghua
    Baraznenok, Elena
    Lan, Lydia
    Poudel, Chetan
    Gao, Gan
    Serafin, Robert B.
    Chow, Sarah S. L.
    Glaser, Adam K.
    Janowczyk, Andrew
    Brenes, David
    Huang, Hongyi
    Miyasato, Dominie
    True, Lawrence D.
    Kang, Soyoung
    Vaughan, Joshua C.
    Liu, Jonathan T. C.
    NATURE PROTOCOLS, 2024, 19 (04) : 1122 - 1148
  • [24] KeypointDETR: An End-to-End 3D Keypoint Detector
    Jin, Hairong
    Shen, Yuefan
    Lou, Jianwen
    Zhou, Kun
    Zheng, Youyi
    COMPUTER VISION - ECCV 2024, PT LXXIV, 2025, 15132 : 374 - 390
  • [25] End-to-end 3D Tracking with Decoupled Queries
    Li, Yanwei
    Yu, Zhiding
    Philion, Jonah
    Anandkumar, Anima
    Fidler, Sanja
    Jia, Jiaya
    Alvarez, Jose
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18256 - 18265
  • [26] 3D Dosimetry in End-To-End Dosimetry QA
    Ibbott, G.
    MEDICAL PHYSICS, 2016, 43 (06) : 3695 - 3695
  • [27] An end-to-end workflow for nondestructive 3D pathology
    Kevin W. Bishop
    Lindsey A. Erion Barner
    Qinghua Han
    Elena Baraznenok
    Lydia Lan
    Chetan Poudel
    Gan Gao
    Robert B. Serafin
    Sarah S. L. Chow
    Adam K. Glaser
    Andrew Janowczyk
    David Brenes
    Hongyi Huang
    Dominie Miyasato
    Lawrence D. True
    Soyoung Kang
    Joshua C. Vaughan
    Jonathan T. C. Liu
    Nature Protocols, 2024, 19 : 1122 - 1148
  • [28] Camera and LiDAR Fusion for Robust 3D Person Detection in Indoor Environments
    Silva, Carlos A.
    Dogru, Sedat
    Marques, Lino
    2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC, 2023, : 187 - 192
  • [29] VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
    Zhou, Yin
    Tuzel, Oncel
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4490 - 4499
  • [30] BLPNet: An End-to-End Model Towards Voxelization Free 3D Object Detection
    Cui, Zhihao
    Zhang, Zhenhua
    2020 JOINT 9TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2020 4TH INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2020,