SemanticAnchors: Sequential Fusion using Lidar Point Cloud and Anchors with Semantic Annotations for 3D Object Detection

被引：1

作者：

Gao, Zhentong ^{[1
]}

Wang, Qiantong ^{[1
]}

Pan, Zongxu ^{[1
]}

Long, Hui ^{[1
]}

Hu, Yuxin ^{[1
]}

Li, Zheng ^{[1
]}

机构：

[1] Chinese Acad Sci, Key Lab Technol Geospatial Informat Proc & Applic, Aerosp Informat Res Inst, Beijing, Peoples R China

来源：

2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA) | 2022年

关键词：

3D object detection; autonomous driving; multi sensor;

D O I：

10.1109/ICIEA54703.2022.10006149

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

3D object detection is an important task in autonomous driving scenario, which is the basis of perception and understanding of 3D scenes. LiDAR and camera are two conunonly used sensors in 3D object detection tasks. However, using a single sensor to collect data will make some objects difficult to detect because both sensors have insurmountable shortcomings Unexpectedly, LiDAR-only detection methods tend to show better performance than the multi -sensor methods in public benchmarks. This shows that people need to further explore the methods of combining the data of the two sensors. Recently, PointPainting has been presented to combine the data of LiDAR and camera more efficiently by attaching the result of image semantic segmentation to point cloud data as new channels. In this paper, we propose an error anchor punishment mechanism based on image semantic segmentation results. After the semantic augmentation of the point cloud data, we judge whether the semantic result of each point is correct by traversing the groundtruth boxes. Further, we assign different weights to each anchor according to the error points contained in each anchor. Experimental results on the KITTI valid set show that SemanticAnchors achieves better performance in both 3D and birds eyes view benchmarks. In particular, our method adds little extra computation and achieves performance improvement in all categories.

引用

页码：1128 / 1133

页数：6

共 16 条

[1] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[2] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

[3] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[4]

Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074

[5]

Huang Q, 2017, Arxiv, DOI [arXiv:1707.06426, 10.48550/arxiv.1707.06426]

[6] EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection [J].

Huang, Tengteng ;

Liu, Zhe ;

Chen, Xiwu ;

Bai, Xiang .

COMPUTER VISION - ECCV 2020, PT XV, 2020, 12360 :35-52

[7]

Ku J, 2018, IEEE INT C INT ROBOT, P5750, DOI 10.1109/IROS.2018.8594049

[8] PointPillars: Fast Encoders for Object Detection from Point Clouds [J].

Lang, Alex H. ;

Vora, Sourabh ;

Caesar, Holger ;

Zhou, Lubing ;

Yang, Jiong ;

Beijbom, Oscar .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697

[9]

Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965

[10] CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection [J].

Pang, Su ;

Morris, Daniel ;

Radha, Hayder .

2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, :10386-10393

← 1 2 →