DEEP SENSOR FUSION BASED ON FRUSTUM POINT SINGLE SHOT MULTIBOX DETECTOR FOR 3D OBJECT DETECTION

被引：0

作者：

Wang, Yu ^{[1
]}

Zhang, Ye ^{[1
]}

Zhai, Shaohua ^{[1
]}

Chen, Hao ^{[1
]}

Shi, Shaoqi ^{[1
]}

Wang, Gang ^{[2
]}

机构：

[1] Harbin Inst Technol, Dept Informat Engn, Harbin, Peoples R China

[2] Alibaba Grp, Hangzhou, Peoples R China

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年

基金：

中国国家自然科学基金;

关键词：

Semantic segmentation; frustum point cloud; attention mechanism; sensor fusion; object detection;

D O I：

10.1109/ICIP42928.2021.9506167

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a deep sensor fusion method based on frustum point single shot multibox detector (PointSSD) for autonomous driving scenarios. The proposed method solves the problem of precision degradation in frustum PointNets (F-PointNet) caused by relying heavily on 2D detection and making insufficient use of RGB information. The method mainly consists of two subnetworks: pyramid segmentation network (PSNet) and PointSSD. The proposed PSNet uses a novel architecture capable of performing semantic segmentation on RGB information to generate high quality image semantic information. Using these image semantic information, point cloud semantic information is obtained through projection and is then fused with raw 3D spatial features by deep fusion. The fusion results are processed by PointSSD, which is proposed for classification and bounding box regression. Evaluated on the KITTI dataset, our method is superior to other methods in 3D classification and 3D localization. In addition, our method guarantees robustness to 2D false detections.

引用

页码：674 / 678

页数：5

共 17 条

[1]

[Anonymous], 2016, P COMPUTER VISION EC

[2]

Chen L. C., 2014, ICLR

[3] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

[4]

Chen XZ, 2015, ADV NEUR IN, V28

[5] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[6]

Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074

[7]

Huo YF, 2017, IEEE INT SYMP ELEC

[8]

Li B., 2016, IEEE RSJ INT C INT R, DOI [10.1109/IROS.2017.8205955., DOI 10.1109/IROS.2017.8205955]

[9] Deep Continuous Fusion for Multi-sensor 3D Object Detection [J].

Liang, Ming ;

Yang, Bin ;

Wang, Shenlong ;

Urtasun, Raquel .

COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 :663-678

[10]

Lubinski B., 2021, INT COMMUN HEAT MASS, DOI [10.1016/j.isci.2021.103589, DOI 10.1016/0735-1933(85)90003-X]

← 1 2 →