TinyPillarNet: Tiny Pillar-Based Network for 3D Point Cloud Object Detection at Edge

被引：4

作者：

Li, Yishi ^{[1
,2
]}

Zhang, Yuhao ^{[1
,2
]}

Lai, Rui ^{[1
,2
]}

机构：

[1] Xidian Univ, Sch Microelect, Xian 710071, Peoples R China

[2] Xidian Univ, Chongqing Innovat Res Inst Integrated Circuits, Chongqing 400031, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 03期

关键词：

3D object detection; point cloud; tiny machine learning (TinyML); FPGA;

D O I：

10.1109/TCSVT.2023.3297620

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Limited by huge computational cost, high inference latency and large memory consumption, existing 3D point cloud object detection methods are hard to be deployed on Internet of Things (IoT) edge devices. To handle this challenge, we present an extremely tiny framework termed TinyPillarNet. This framework leverages innovative pillar encoder to represent point cloud as immensely tiny pseudo-maps for extremely shrinking the input 3D sensing data. Moreover, a compact dual-stream feature extraction network is put forward to respectively extract intrinsic feature and distributional saliency map, which jointly boosts the detection precision with the lowest hardware cost. Extended experiments on KITTI benchmark demonstrated that our TinyPillarNet yields applicable precision with a record tiny weight size of 1.69 MB at a high inference speed of 1.67 times faster than the current record. Furthermore, the specially designed prototype verification system achieves a superior energy efficiency, which outperforms the similar deep learning based point cloud processing solutions on FPGA with a big margin.

引用

页码：1772 / 1785

页数：14

共 71 条

[1] A Survey on 3D Object Detection Methods for Autonomous Driving Applications
Arnold, Eduardo
Al-Jarrah, Omar Y.
Dianati, Mehrdad
Fallah, Saber
Oxtoby, David
Mouzakitis, Alex
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) : 3782 - 3795
[2] Beltrán J, 2018, IEEE INT C INTELL TR, P3517, DOI 10.1109/ITSC.2018.8569311
[3] Chen XZ, 2015, ADV NEUR IN, V28
[4] Choi J, 2018, Arxiv, DOI arXiv:1805.06085
[5] From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection
Deng, Jiajun
Zhou, Wengang
Zhang, Yanyong
Li, Houqiang
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4722 - 4734
[6] TinyML Meets IoT: A Comprehensive Survey
Dutta, Lachit
Bharali, Swapna
[J]. INTERNET OF THINGS, 2021, 16
[7] Layer-Specific Optimization for Mixed Data Flow With Mixed Precision in FPGA Design for CNN-Based Object Detectors
Duy Thanh Nguyen
Kim, Hyun
Lee, Hyuk-Jae
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2450 - 2464
[8] Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, DOI 10.48550/ARXIV.1704.04861]
[9] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[10] 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks
Graham, Benjamin
Engelcke, Martin
van der Maaten, Laurens
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9224 - 9232

← 1 2 3 4 5 6 7 8 →