Deep multi-scale and multi-modal fusion for 3D object detection

被引：17

作者：

Guo, Rui ^{[1
,3
]}

Li, Deng ^{[2
]}

Han, Yahong ^{[2
]}

机构：

[1] Southeast Univ, Sch Energy & Environm, Nanjing, Peoples R China

[2] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

[3] Southeast Univ, Natl Engn Res Ctr Turbo Generator Vibrat, Nanjing, Peoples R China

来源：

PATTERN RECOGNITION LETTERS | 2021年 / 151卷

关键词：

3D Object detection; Feature fusion; Autonomous driving; Point cloud;

D O I：

10.1016/j.patrec.2021.08.028

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The perception of 3D objects in the scene is the basis of autonomous driving. Most autonomous driving cars are equipped with cameras and Lidar to obtain 3D spatial information. RGB images taken from the camera and point cloud produced by Lidar both have their own advantages for 3D object detection. In order to make better use of the advantages of image data and point cloud data, a 3D object detection method based on Deep Multi-scale and Multi-modal Fusion (DMMF) is proposed. Firstly, point cloud is projected to the Bird's Eye View (BEV) and extract BEV map and RGB image feature with feature extractor, respectively. Then, fuse the multi-modal feature with the deep multi-scale fusion method and finally input to position regression and classification network for object classification and accurate positioning. The experimental results on the benchmark KITTI dataset show that the method reaches state-of-theart in both car and pedestrian classes, especially for hard level data, the detection AP is significantly improved. (c) 2021 Elsevier B.V. All rights reserved.

引用

页码：236 / 242

页数：7

共 50 条

[1] Multi-Modal and Multi-Scale Fusion 3D Object Detection of 4D Radar and LiDAR for Autonomous Driving
Wang, Li
Zhang, Xinyu
Li, Jun
Xv, Baowei
Fu, Rong
Chen, Haifeng
Yang, Lei
Jin, Dafeng
Zhao, Lijun
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 5628 - 5641
[2] Multi-modal 3D object detection by 2D-guided precision anchor proposal and multi-layer fusion
Wu, Yi
Jiang, Xiaoyan
Fang, Zhijun
Gao, Yongbin
Fujita, Hamido
APPLIED SOFT COMPUTING, 2021, 108
[3] Multi-scale multi-modal fusion for object detection in autonomous driving based on selective kernel
Gao, Xin
Zhang, Guoying
Xiong, Yijin
MEASUREMENT, 2022, 194
[4] Improving Deep Multi-modal 3D Object Detection for Autonomous Driving
Khamsehashari, Razieh
Schill, Kerstin
2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 263 - 267
[5] BSM-NET: multi-bandwidth, multi-scale and multi-modal fusion network for 3D object detection of 4D radar and LiDAR
Jiang, Tiezhen
Kang, Runjie
Li, Qingzhu
MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (03)
[6] MEDMCN: a novel multi-modal EfficientDet with multi-scale CapsNet for object detection
Li, Xingye
Liu, Jin
Tang, Zhengyu
Han, Bing
Wu, Zhongdai
JOURNAL OF SUPERCOMPUTING, 2024, 80 (09) : 12863 - 12890
[7] Dual-domain deformable feature fusion for multi-modal 3D object detection
Wang, Shihao
Deng, Tao
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
[8] Multi-modal information fusion for LiDAR-based 3D object detection framework
Ma, Ruixin
Yin, Yong
Chen, Jing
Chang, Rihao
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 7995 - 8012
[9] Deep Multi-modal Object Detection for Autonomous Driving
Ennajar, Amal
Khouja, Nadia
Boutteau, Remi
Tlili, Fethi
2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2021, : 7 - 11
[10] Multi-Modal 3D Object Detection in Autonomous Driving: A Survey
Wang, Yingjie
Mao, Qiuyu
Zhu, Hanqi
Deng, Jiajun
Zhang, Yu
Ji, Jianmin
Li, Houqiang
Zhang, Yanyong
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 2122 - 2152

← 1 2 3 4 5 →