An Efficient Point Cloud Correlation Enhancement RCNN for 3D Object Detection

被引：0

作者：

Du, Jialong ^{[1
]}

Huang, Hanzhang ^{[2
]}

Tan, Qingji ^{[3
]}

Li, Yong ^{[1
,4
]}

Ding, Lu ^{[1
]}

Shuang, Feng ^{[1
]}

机构：

[1] Guangxi Univ, Sch Elect Engn, Guangxi Key Lab Intelligent Control & Maintenance, Nanning 530004, Peoples R China

[2] China Tobacco Guangxi Ind Co Ltd, Nanning 530001, Peoples R China

[3] Guangxi Univ, Sch Mech Engn, Nanning 530004, Peoples R China

[4] Minist Educ, Key Lab Adv Mfg Technol, Guiyang 550025, Guizhou, Peoples R China

来源：

INFORMATION TECHNOLOGY AND CONTROL | 2025年 / 54卷 / 01期

关键词：

3-D Object Detection; Lightweight Proposal; Self-Attention; Point Cloud; Autonomous Driving;

D O I：

10.5755/j01.itc.54.1.35616

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To meet the requirement of 3D object detection task, an efficient point cloud correlation enhancement RCNN(EPCE-RCNN) is proposed. The proposed method reduces the computational complexity and time consumption of the network through a lightweight proposal generation module, and accelerates the generation of the 3D proposal box. Meanwhile, during region of interest feature coding, the relevance among different grid points is enhanced through an efficient self-attention pooling module, so that the limitation that the pooling operation is influenced by the radius of a neighborhood query sphere is addressed. In addition, the combination of an attention mechanism and a feedforward network ensures the nonlinearity of the model, so that the model can perform feature expression better. Thus, the synchronous improvement of the network detection efficiency and the detection precision is realized. On the KITTI dataset, the detection accuracy of three difficulty levels reaches 89.99%, 81.69% and 77.17% respectively. Compared with the baseline Voxel-RCNN, the detection efficiency of EPCE-RCNN is improved by 12%. To verify the generalization and application value of the proposed method, a power equipment dataset with 3D label information is constructed, the 3D label frame information of the YCB dataset is also supplemented. Experiments are carried out on these datasets. In the experimental results of the validation set, the mAP of a mug, gelatin box, single clip, wedge clip and C clip can reach 37.67%, 40.06%, 35.63%, 30.01% and 37.31% respectively. Compared with the baseline, the proposed algorithm has a significant improvement and its generalization has been fully verified.

引用

页码：198 / 218

页数：360

共 53 条

[1] Scale-Hierarchical 3D Object Recognition in Cluttered Scenes [J].

Bariya, Prabin ;

Nishino, Ko .

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1657-1664

[2]

Chen C, 2022, AAAI CONF ARTIF INTE, P221

[3] Point signatures: A new representation for 3D object recognition [J].

Chua, CS ;

Jarvis, R .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 1997, 25 (01) :63-85

[4]

Deng JJ, 2021, AAAI CONF ARTIF INTE, V35, P1201

[5] Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous Driving [J].

Dong, Yinpeng ;

Kang, Caixin ;

Zhang, Jinlai ;

Zhu, Zijian ;

Wang, Yikai ;

Yang, Xiao ;

Su, Hang ;

Wei, Xingxing ;

Zhu, Jun .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :1022-1032

[6] Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection [J].

Du, Liang ;

Ye, Xiaoqing ;

Tan, Xiao ;

Feng, Jianfeng ;

Xu, Zhenbo ;

Ding, Errui ;

Wen, Shilei .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13326-13335

[7] 3D Object Detection with a Self-supervised Lidar Scene Flow Backbone [J].

Ercelik, Emec ;

Yurtsever, Ekim ;

Liu, Mingyu ;

Yang, Zhijie ;

Zhang, Hanzhen ;

Topcam, Pinar ;

Listl, Maximilian ;

Cayli, Yilmaz Kaan ;

Knoll, Alois .

COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 :247-265

[8]

Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, 10.48550/arXiv.1704.04861]

[9] Vision meets robotics: The KITTI dataset [J].

Geiger, A. ;

Lenz, P. ;

Stiller, C. ;

Urtasun, R. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237

[10] GhostNet: More Features from Cheap Operations [J].

Han, Kai ;

Wang, Yunhe ;

Tian, Qi ;

Guo, Jianyuan ;

Xu, Chunjing ;

Xu, Chang .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586

← 1 2 3 4 5 6 →