LMIINet: long-range and multi-scale information interaction network for 3D object detection

被引:0
|
作者
Mai, Chengfeng [1 ]
Wang, Haosen [2 ]
Wang, Cui [3 ]
Zhang, Bo [1 ]
Kodagoda, Sarath [4 ]
Wang, Shifeng [1 ,3 ]
机构
[1] Changchun Univ Sci & Technol, Sch Optoelect Engn, Changchun, Peoples R China
[2] Sun Yat Sen Univ, Sch Artificial Intelligence, Guangzhou, Peoples R China
[3] Changchun Univ Sci & Technol, Zhongshan Inst, Zhongshan, Peoples R China
[4] Univ Technol Sydney, Sydney, NSW, Australia
关键词
3D object detection; LiDAR; spatial feature pyramid; multi-scale information interaction; RCIoU loss;
D O I
10.1117/1.JEI.33.6.063034
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
LiDAR-based 3D object detection is crucial for perception systems in autonomous driving. However, current methods perform poorly in detecting occluded and small objects due to the sparse and uneven distribution of point clouds in 3D scenes. To address this problem, we propose a long-range and multi-scale information interaction network (LMIINet) for 3D object detection. First, a feature extraction backbone with a spatial feature pyramid block is designed to effectively capture long-range dependencies between features. Then, multi-scale spatial features from the 3D backbone are adaptively fused at the neck to aggregate local and global contextual information. Finally, by fully utilizing the correlations between bounding box parameters, the proposed rotation-decoupled corrected intersection-over-union (RCIoU) loss is employed for the classification and regression supervision of bounding boxes to improve the one-stage point cloud detectors. LMIINet achieves competitive performance on small objects (pedestrians and cyclists) detection in the KITTI dataset. Compared with the benchmark network SECOND, LMIINet increases mAP3D/mAPBEV by 1.98%/2.09%, 12.25%/11.14%, and 6.44%/6.45% for the car, pedestrian, and cyclist classes, respectively. (c) 2024 SPIE and IS&T
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Multi-Scale PointPillars 3D Object Detection Network
    Ya, Hang
    Luo, Guiming
    PROCEEDINGS OF THE 2019 IEEE 18TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2019), 2019, : 174 - 179
  • [2] Multi-scale attention and boundary enhancement with long-range dependency for salient object detection
    Yu, Ming
    Lin, Xiaoqing
    Liu, Yi
    Guo, Yingchun
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (06) : 8957 - 8969
  • [3] Towards Long-Range 3D Object Detection for Autonomous Vehicles
    Khoche, Ajinkya
    Sanchez, Laura Pereira
    Batool, Nazre
    Mansouri, Sina Sharif
    Jensfelt, Patric
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 2206 - 2212
  • [4] Multi-Scale Spatial Transformer Network for LiDAR-Camera 3D Object Detection
    Wang, Zhifan
    Zhang, Xiaohong
    Wang, Shidong
    Xin, Tong
    Zhang, Haofeng
    Lu, Jianfeng
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] 3D Object Detection Algorithm for Panoramic Images With Multi-Scale Convolutional Neural Network
    Wang, Dianwei
    He, Yanhui
    Liu, Ying
    Li, Daxiang
    Wu, Shiqian
    Qin, Yongrui
    Xu, Zhijie
    IEEE ACCESS, 2019, 7 : 171461 - 171470
  • [6] Spatial Information Enhancement with Multi-Scale Feature Aggregation for Long-Range Object and Small Reflective Area Object Detection from Point Cloud
    Li, Hanwen
    Tao, Huamin
    Deng, Qiuqun
    Xiao, Shanzhu
    Zhou, Jianxiong
    REMOTE SENSING, 2024, 16 (14)
  • [7] Joint multi-scale information and long-range dependence for video captioning
    Zhai, Zhongyi
    Chen, Xiaofeng
    Huang, Yishuang
    Zhao, Lingzhong
    Cheng, Bo
    He, Qian
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
  • [8] Joint multi-scale information and long-range dependence for video captioning
    Zhongyi Zhai
    Xiaofeng Chen
    Yishuang Huang
    Lingzhong Zhao
    Bo Cheng
    Qian He
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [9] Deep multi-scale and multi-modal fusion for 3D object detection
    Guo, Rui
    Li, Deng
    Han, Yahong
    PATTERN RECOGNITION LETTERS, 2021, 151 : 236 - 242
  • [10] Multi-Scale Keypoints Feature Fusion Network for 3D Object Detection from Point Clouds
    Zhang, Xu
    Bai, Linjuan
    Zhang, Zuyu
    Li, Yan
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2022, 12