Prior Distribution and Perception Radius Are Important: Learning 3-D Object Detector for Autonomous Driving

被引：0

作者：

Tao, Ziming ^{[1
]}

Mao, Jianxu ^{[1
]}

Peng, Weixing ^{[2
]}

Wang, Yaonan ^{[1
]}

Yi, Junfei ^{[1
]}

Zhang, Hui ^{[2
]}

机构：

[1] Hunan Univ, Coll Elect & Informat Engn, Natl Engn Lab Robot Visual Percept & Control Techn, Changsha 410082, Hunan, Peoples R China

[2] Hunan Univ, Coll Robot, Natl Engn Res Ctr Robot Visual Percept & Control T, Changsha 410082, Hunan, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2025年 / 74卷

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Detectors; Point cloud compression; Feature extraction; Semantics; Object detection; Laser radar; Autonomous vehicles; Proposals; Automobiles; 3-D detection; artificial intelligence; autonomous driving; computer vision; intelligent vehicles; LiDAR measurement;

D O I：

10.1109/TIM.2025.3566807

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This article proposes an efficient 3-D object detection algorithm for LiDAR point clouds in the field of autonomous driving. Currently, despite significant performance improvements in 3-D LiDAR object detection methods, there are still problems that have impact on 3-D detection accuracy. Typically, since the sampling process and grouping process introduced background points, the final regression step in the detection task would be greatly affected. To address this problem, a method named PR-SSD is proposed to retain more important foreground points during the training pipeline, improving detection accuracy. Specifically, the proposed approach utilizes prior information to guide semantic sampling, referred to as prior distributed semantic sampling (PDSS). This mechanism encourages the network to prioritize foreground targets in regression. In addition, a module named radius-aware attention multiscale grouping (RAMSG) is designed to dynamically reassign grouping weights for multiscale features. In other words, the network has adaptive detection capabilities for targets of different scales. Furthermore, PR-SSD is a single-stage detector which can be trained end-to-end. Finally, extensive experiments and evaluations on large-scale benchmarks KITTI demonstrate that our proposed method significantly enhanced 3-D object detector. In KITTI testing set, PR-SSD achieves 89.69%, 45.08%, and 80.01% detection performance for cars, pedestrians, and cyclists.

引用

页数：13

共 41 条

[1] ScorePillar: A Real-Time Small Object Detection Method Based on Pillar Scoring of Lidar Measurement [J].

Cao, Zonghan ;

Wang, Ting ;

Sun, Ping ;

Cao, Fengkui ;

Shao, Shiliang ;

Wang, Shaocong .

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 :1-13

[2]

Chen C, 2022, AAAI CONF ARTIF INTE, P221

[3] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

[4]

Dosovitskiy A, 2017, PR MACH LEARN RES, V78

[5]

Fan L, 2022, ADV NEUR IN

[6] Embracing Single Stride 3D Object Detector with Sparse Transformer [J].

Fan, Lue ;

Pang, Ziqi ;

Zhang, Tianyuan ;

Wang, Yu-Xiong ;

Zhao, Hang ;

Wang, Feng ;

Wang, Naiyan ;

Zhang, Zhaoxiang .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :8448-8458

[7]

Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074

[8]

Hayeon O., 2024, P AS C COMP VIS ACCV, P2889

[9]

Ku J, 2018, IEEE INT C INT ROBOT, P5750, DOI 10.1109/IROS.2018.8594049

[10] PointPillars: Fast Encoders for Object Detection from Point Clouds [J].

Lang, Alex H. ;

Vora, Sourabh ;

Caesar, Holger ;

Zhou, Lubing ;

Yang, Jiong ;

Beijbom, Oscar .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697

← 1 2 3 4 5 →