ACF-Net: Asymmetric Cascade Fusion for 3D Detection With LiDAR Point Clouds and Images

被引:8
|
作者
Tian, Yonglin [1 ,2 ]
Zhang, Xianjing [2 ]
Wang, Xiao [3 ]
Xu, Jintao [2 ]
Wang, Jiangong [1 ]
Ai, Rui [4 ]
Gu, Weihao [4 ]
Ding, Weiping [5 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Haomo Technol Co Ltd, AI Ctr, Beijing 100192, Peoples R China
[3] Anhui Univ, Sch Artificial Intelligence, Hefei 230031, Peoples R China
[4] Haomo Technol Co Ltd, Beijing 100192, Peoples R China
[5] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
来源
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 02期
关键词
Three-dimensional displays; Feature extraction; Point cloud compression; Laser radar; Object detection; Timing; Fuses; 3D detection; autonomous driving; asymmetric fusion; cascade fusion; multimodal fusion; OBJECT; PERFORMANCE;
D O I
10.1109/TIV.2023.3341223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recognition and utilization of complementary information arising from modality-intrinsic properties play crucial roles in multimodal 3D detection. However, most of the current approaches for fusion-based 3D detection follow symmetrical fusion paradigms and adopt early fusion, middle fusion as well as late fusion styles, which ignore the unequal status of data with different modalities. In this paper, according to the timing of fusion, we adopt an asymmetric cascade fusion network to exploit both the structural information from point clouds and the complementary semantic information from images. A multi-stage cascade design of 3D object detection is proposed to iteratively refine predictions and several late image features (comprised of detection clues, segmentation clues, and deep features from encoders) are incorporated into different stages of the LiDAR branch to maintain the integrity of image features and enable deep multimodal interactions. Besides, to mitigate the effects of the down-sampling of voxelized features and possible mismatching of multimodal data, we propose proxy-based cross-modality sampling to utilize the high-density point clouds coordinates and develop an image degeneration process to simulate the noise in cross-modality matching for robust training. Extensive experiments are conducted on KITTI and Waymo Open Dataset, which validate the effectiveness of the proposed method.
引用
收藏
页码:3360 / 3371
页数:12
相关论文
共 50 条
  • [1] 3D Vehicle Detection Using Multi-Level Fusion From Point Clouds and Images
    Zhao, Kun
    Ma, Lingfei
    Meng, Yu
    Liu, Li
    Wang, Junbo
    Marcato, Jose, Jr.
    Goncalves, Wesley Nunes
    Li, Jonathan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 15146 - 15154
  • [2] STFNET: Sparse Temporal Fusion for 3D Object Detection in LiDAR Point Cloud
    Meng, Xin
    Zhou, Yuan
    Ma, Jun
    Jiang, Fangdi
    Qi, Yongze
    Wang, Cui
    Kim, Jonghyuk
    Wang, Shifeng
    IEEE SENSORS JOURNAL, 2025, 25 (03) : 5866 - 5877
  • [3] SIEV-Net: A Structure-Information Enhanced Voxel Network for 3D Object Detection From LiDAR Point Clouds
    Yu, Chuanbo
    Lei, Jianjun
    Peng, Bo
    Shen, Haifeng
    Huang, Qingming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] LiDAR-Camera Fusion in Perspective View for 3D Object Detection in Surface Mine
    Ai, Yunfeng
    Yang, Xue
    Song, Ruiqi
    Cui, Chenglin
    Li, Xinqing
    Cheng, Qi
    Tian, Bin
    Chen, Long
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (02): : 3721 - 3730
  • [5] CL3D: Camera-LiDAR 3D Object Detection With Point Feature Enhancement and Point-Guided Fusion
    Lin, Chunmian
    Tian, Daxin
    Duan, Xuting
    Zhou, Jianshan
    Zhao, Dezong
    Cao, Dongpu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18040 - 18050
  • [6] 3D Cascade RCNN: High Quality Object Detection in Point Clouds
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Mei, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5706 - 5719
  • [7] DA-Net: Density-Aware 3D Object Detection Network for Point Clouds
    Wang, Shuhua
    Lu, Ke
    Xue, Jian
    Zhao, Yang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 665 - 678
  • [8] MSL-Net: Sharp Feature Detection Network for 3D Point Clouds
    Jiao, Xianhe
    Lv, Chenlei
    Yi, Ran
    Zhao, Junli
    Pan, Zhenkuan
    Wu, Zhongke
    Liu, Yong-Jin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (09) : 6433 - 6446
  • [9] A Novel Interactive Fusion Method with Images and Point Clouds for 3D Object Detection
    Xu, Kai
    Yang, Zhile
    Xu, Yangjie
    Feng, Liangbing
    APPLIED SCIENCES-BASEL, 2019, 9 (06):
  • [10] EDT-Net: A Lightweight Tunnel Water Leakage Detection Network Based on LiDAR Point Clouds Intensity Images
    Liu, Zhenyu
    Gao, Xianjun
    Yang, Yuanwei
    Xu, Lei
    Wang, Shaoning
    Chen, Ningsheng
    Wang, Zhiwei
    Kou, Yuan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 7334 - 7346