EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection

被引:296
作者
Huang, Tengteng [1 ]
Liu, Zhe [1 ]
Chen, Xiwu [1 ]
Bai, Xiang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT XV | 2020年 / 12360卷
基金
国家重点研发计划;
关键词
3D object detection; Point cloud; Multiple sensors;
D O I
10.1007/978-3-030-58555-6_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we aim at addressing two critical issues in the 3D detection task, including the exploitation of multiple sensors (namely LiDAR point cloud and camera image), as well as the inconsistency between the localization and classification confidence. To this end, we propose a novel fusion module to enhance the point features with semantic image features in a point-wise manner without any image annotations. Besides, a consistency enforcing loss is employed to explicitly encourage the consistency of both the localization and classification confidence. We design an end-to-end learnable framework named EPNet to integrate these two components. Extensive experiments on the KITTI and SUN-RGBD datasets demonstrate the superiority of EPNet over the state-of-the-art methods. Codes and models are available at: https://github.com/happinesslz/EPNet.
引用
收藏
页码:35 / 52
页数:18
相关论文
共 43 条
[1]   3D Object Proposals Using Stereo Imagery for Accurate Object Class Detection [J].
Chen, Xiaozhi ;
Kundu, Kaustav ;
Zhu, Yukun ;
Ma, Huimin ;
Fidler, Sanja ;
Urtasun, Raquel .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) :1259-1272
[2]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[3]   Monocular 3D Object Detection for Autonomous Driving [J].
Chen, Xiaozhi ;
Kundu, Kaustav ;
Zhang, Ziyu ;
Ma, Huimin ;
Fidler, Sanja ;
Urtasun, Raquel .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2147-2156
[4]   SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation with Semi-supervised Learning [J].
Chen, Yujin ;
Tu, Zhigang ;
Ge, Liuhao ;
Zhang, Dejun ;
Chen, Ruizhi ;
Yuan, Junsong .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6960-6969
[5]  
Du XX, 2018, IEEE INT CONF ROBOT, P3194
[6]  
Geiger A., P IEEE INT C COMP VI
[7]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[8]  
Ioffe Sergey, 2015, Proceedings of Machine Learning Research, V37, P448, DOI DOI 10.48550/ARXIV.1502.03167
[9]   Acquisition of Localization Confidence for Accurate Object Detection [J].
Jiang, Borui ;
Luo, Ruixuan ;
Mao, Jiayuan ;
Xiao, Tete ;
Jiang, Yuning .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :816-832
[10]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001