Equal Emphasis on Data and Network: A Two-Stage 3D Point Cloud Object Detection Algorithm with Feature Alignment

被引:1
|
作者
Xiao, Kai [1 ]
Li, Teng [2 ,3 ]
Li, Jun [4 ]
Huang, Da [1 ]
Peng, Yuanxi [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp Sci & Technol, State Key Lab High Performance Comp, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, Beijing Inst Adv Study, Beijing 100020, Peoples R China
[3] Natl Univ Def Technol, Coll Adv Interdisciplinary Studies, Changsha 410073, Peoples R China
[4] Natl Univ Def Technol, Coll Elect Sci, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; data imbalance; feature alignment; feature fusion; two-stage detection network; MULTIVIEW;
D O I
10.3390/rs16020249
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Three-dimensional object detection is a pivotal research topic in computer vision, aiming to identify and locate objects in three-dimensional space. It has wide applications in various fields such as geoscience, autonomous driving, and drone navigation. The rapid development of deep learning techniques has led to significant advancements in 3D object detection. However, with the increasing complexity of applications, 3D object detection faces a series of challenges such as data imbalance and the effectiveness of network models. Specifically, in an experiment, our investigation revealed a notable discrepancy in the LiDAR reflection intensity within a point cloud scene, with stronger intensities observed in proximity and weaker intensities observed at a distance. Furthermore, we have also noted a substantial disparity in the number of foreground points compared to the number of background points. Especially in 3D object detection, the foreground point is more important than the background point, but it is usually downsampled without discrimination in the subsequent processing. With the objective of tackling these challenges, we work from both data and network perspectives, designing a feature alignment filtering algorithm and a two-stage 3D object detection network. Firstly, in order to achieve feature alignment, we introduce a correction equation to decouple the relationship between distance and intensity and eliminate the attenuation effect of intensity caused by distance. Then, a background point filtering algorithm is designed by using the aligned data to alleviate the problem of data imbalance. At the same time, we take into consideration the fact that the accuracy of semantic segmentation plays a crucial role in 3D object detection. Therefore, we propose a two-stage deep learning network that integrates spatial and spectral information, in which a feature fusion branch is designed and embedded in the semantic segmentation backbone. Through a series of experiments on the KITTI dataset, it is proven that the proposed method achieves the following average precision (AP_R40) values for easy, moderate, and hard difficulties, respectively: car (Iou 0.7)-89.23%, 80.14%, and 77.89%; pedestrian (Iou 0.5)-52.32%, 45.47%, and 38.78%; and cyclist (Iou 0.5)-76.41%, 61.92%, and 56.39%. By emphasizing both data quality optimization and efficient network architecture, the performance of the proposed method is made comparable to other state-of-the-art methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] PPMGNet: A Neural Network Algorithm for Point Cloud 3D Object Detection
    Peng, Xiaohong
    Wang, Sen
    Geng, Shuqin
    Zhang, Zhe
    Tang, Haonan
    Wang, Yu
    Wang, Jie
    Li, Xuefeng
    Du, Jianing
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2020, : 53 - 56
  • [2] Pyramid-feature-fusion-based Two-stage Vehicle Detection via 3D Point Cloud
    Zhang M.-F.
    Wu Y.-F.
    Wang L.
    Wang P.-W.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2022, 22 (05): : 107 - 116
  • [3] A Two-Stage Pillar Feature-Encoding Network for Pillar-Based 3D Object Detection
    Xu, Hao
    Dong, Xiang
    Wu, Wenxuan
    Yu, Biao
    Zhu, Hui
    WORLD ELECTRIC VEHICLE JOURNAL, 2023, 14 (06):
  • [4] Distance-dependent Feature Alignment and Selection for Imbalance 3D Point Cloud Object Detection
    Chang, Ming-Jen
    Cheng, Chih-Jen
    Hsiao, Ching-Chun
    Chou, I-Fan
    Huang, Ching-Chun
    2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
  • [5] A Novel Feature Point Detection Algorithm of Unstructured 3D Point Cloud
    Tian, Bei
    Jiang, Peilin
    Zhang, Xuetao
    Zhang, Yulong
    Wang, Fei
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 736 - 744
  • [6] 3D point cloud object detection algorithm based on Transformer
    Liu M.
    Yang Q.
    Hu G.
    Guo Y.
    Zhang J.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2023, 41 (06): : 1190 - 1197
  • [7] 3D OBJECT DETECTION NETWORK COMBINED WITH POINT CLOUD COMPLETION
    Zhou, Jing
    Yu, Chao
    Zhang, Junchi
    Hu, Yiyu
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2024, 25 (05) : 789 - 809
  • [8] Adaptive learning point cloud and image diversity feature fusion network for 3D object detection
    Weiqing Yan
    Shile Liu
    Hao Liu
    Guanghui Yue
    Xuan Wang
    Yongchao Song
    Jindong Xu
    Complex & Intelligent Systems, 2024, 10 : 2825 - 2837
  • [9] Adaptive learning point cloud and image diversity feature fusion network for 3D object detection
    Yan, Weiqing
    Liu, Shile
    Liu, Hao
    Yue, Guanghui
    Wang, Xuan
    Song, Yongchao
    Xu, Jindong
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 2825 - 2837
  • [10] 3D Object Detection Based on Feature Fusion of Point Cloud Sequences
    Zhai, Zhenyu
    Wang, Qiantong
    Pan, Zongxu
    Hu, Wenlong
    Hu, Yuxin
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1240 - 1245