Deep multi-scale and multi-modal fusion for 3D object detection

被引:17
|
作者
Guo, Rui [1 ,3 ]
Li, Deng [2 ]
Han, Yahong [2 ]
机构
[1] Southeast Univ, Sch Energy & Environm, Nanjing, Peoples R China
[2] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[3] Southeast Univ, Natl Engn Res Ctr Turbo Generator Vibrat, Nanjing, Peoples R China
关键词
3D Object detection; Feature fusion; Autonomous driving; Point cloud;
D O I
10.1016/j.patrec.2021.08.028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The perception of 3D objects in the scene is the basis of autonomous driving. Most autonomous driving cars are equipped with cameras and Lidar to obtain 3D spatial information. RGB images taken from the camera and point cloud produced by Lidar both have their own advantages for 3D object detection. In order to make better use of the advantages of image data and point cloud data, a 3D object detection method based on Deep Multi-scale and Multi-modal Fusion (DMMF) is proposed. Firstly, point cloud is projected to the Bird's Eye View (BEV) and extract BEV map and RGB image feature with feature extractor, respectively. Then, fuse the multi-modal feature with the deep multi-scale fusion method and finally input to position regression and classification network for object classification and accurate positioning. The experimental results on the benchmark KITTI dataset show that the method reaches state-of-theart in both car and pedestrian classes, especially for hard level data, the detection AP is significantly improved. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:236 / 242
页数:7
相关论文
共 50 条
  • [1] Multi-Modal and Multi-Scale Fusion 3D Object Detection of 4D Radar and LiDAR for Autonomous Driving
    Wang, Li
    Zhang, Xinyu
    Li, Jun
    Xv, Baowei
    Fu, Rong
    Chen, Haifeng
    Yang, Lei
    Jin, Dafeng
    Zhao, Lijun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 5628 - 5641
  • [2] Multi-modal 3D object detection by 2D-guided precision anchor proposal and multi-layer fusion
    Wu, Yi
    Jiang, Xiaoyan
    Fang, Zhijun
    Gao, Yongbin
    Fujita, Hamido
    APPLIED SOFT COMPUTING, 2021, 108
  • [3] Multi-scale multi-modal fusion for object detection in autonomous driving based on selective kernel
    Gao, Xin
    Zhang, Guoying
    Xiong, Yijin
    MEASUREMENT, 2022, 194
  • [4] Improving Deep Multi-modal 3D Object Detection for Autonomous Driving
    Khamsehashari, Razieh
    Schill, Kerstin
    2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 263 - 267
  • [5] BSM-NET: multi-bandwidth, multi-scale and multi-modal fusion network for 3D object detection of 4D radar and LiDAR
    Jiang, Tiezhen
    Kang, Runjie
    Li, Qingzhu
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (03)
  • [6] MEDMCN: a novel multi-modal EfficientDet with multi-scale CapsNet for object detection
    Li, Xingye
    Liu, Jin
    Tang, Zhengyu
    Han, Bing
    Wu, Zhongdai
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (09) : 12863 - 12890
  • [7] Dual-domain deformable feature fusion for multi-modal 3D object detection
    Wang, Shihao
    Deng, Tao
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
  • [8] Multi-modal information fusion for LiDAR-based 3D object detection framework
    Ma, Ruixin
    Yin, Yong
    Chen, Jing
    Chang, Rihao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 7995 - 8012
  • [9] Deep Multi-modal Object Detection for Autonomous Driving
    Ennajar, Amal
    Khouja, Nadia
    Boutteau, Remi
    Tlili, Fethi
    2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2021, : 7 - 11
  • [10] Multi-Modal 3D Object Detection in Autonomous Driving: A Survey
    Wang, Yingjie
    Mao, Qiuyu
    Zhu, Hanqi
    Deng, Jiajun
    Zhang, Yu
    Ji, Jianmin
    Li, Houqiang
    Zhang, Yanyong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 2122 - 2152