LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector

被引:74
作者
Guo, Xiaoyang [1 ]
Shi, Shaoshuai [1 ]
Wang, Xiaogang [1 ]
Li, Hongsheng [1 ]
机构
[1] Chinese Univ Hong Kong, CUHK SenseTime Joint Lab, Hong Kong, Peoples R China
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.00314
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stereo-based 3D detection aims at detecting 3D objects from stereo images, which provides a low-cost solution for 3D perception. However, its performance is still inferior compared with LiDAR-based detection algorithms. To detect and localize accurate 3D bounding boxes, LiDAR-based detectors encode high-level representations from LiDAR point clouds, such as accurate object boundaries and surface normal directions. In contrast, high-level features learned by stereo-based detectors are easily affected by the erroneous depth estimation due to the limitation of stereo matching. To solve the problem, we propose LIGA-Stereo (LiDAR Geometry Aware Stereo Detector) to learn stereo-based 3D detectors under the guidance of high-level geometry-aware representations of LiDAR-based detection models. In addition, we found existing voxel-based stereo detectors failed to learn semantic features effectively from indirect 3D supervisions. We attach an auxiliary 2D detection head to provide direct 2D semantic supervisions. Experiment results show that the above two strategies improved the geometric and semantic representation capabilities. Compared with the state-of-the-art stereo detector, our method has improved the 3D detection performance of cars, pedestrians, cyclists by 10.44%, 5.69%, 5.97% mAP respectively on the official KITTI benchmark. The gap between stereo-based and LiDAR-based 3D detectors is further narrowed. The code is available at https: //xy-guo.github.io/liga/.
引用
收藏
页码:3133 / 3143
页数:11
相关论文
共 70 条
[41]   End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection [J].
Qian, Rui ;
Garg, Divyansh ;
Wang, Yan ;
You, Yurong ;
Belongie, Serge ;
Hariharan, Bharath ;
Campbell, Mark ;
Weinberger, Kilian Q. ;
Chao, Wei-Lun .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5880-5889
[42]   Triangulation Learning Network: from Monocular to Stereo 3D Object Detection [J].
Qin, Zengyi ;
Wang, Jinglu ;
Lu, Yan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7607-7615
[43]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[44]  
Romero Adriana, 2015, P ICLR
[45]   PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection [J].
Shi, Shaoshuai ;
Guo, Chaoxu ;
Jiang, Li ;
Wang, Zhe ;
Shi, Jianping ;
Wang, Xiaogang ;
Li, Hongsheng .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10526-10535
[46]   PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud [J].
Shi, Shaoshuai ;
Wang, Xiaogang ;
Li, Hongsheng .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :770-779
[47]  
Song Xiao, 2018, AS C COMP VIS
[48]   Spatiality and Its Semantic Consequence of the Quantitative Expression "-CCN" in Mandarin Chinese [J].
Sun, Jing ;
Yuan, Yulin .
CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 :3-13
[49]  
Tripathee L, 2020, WATER QUALITY IN THE THIRD POLE: THE ROLES OF CLIMATE CHANGE AND HUMAN ACTIVITIES, P3, DOI 10.1016/B978-0-12-816489-1.00001-3
[50]  
Wang L, 2019, CHIN CONTR CONF, P8445, DOI [10.23919/ChiCC.2019.8866509, 10.23919/chicc.2019.8866509]