TinyPillarNet: Tiny Pillar-Based Network for 3D Point Cloud Object Detection at Edge

被引:4
作者
Li, Yishi [1 ,2 ]
Zhang, Yuhao [1 ,2 ]
Lai, Rui [1 ,2 ]
机构
[1] Xidian Univ, Sch Microelect, Xian 710071, Peoples R China
[2] Xidian Univ, Chongqing Innovat Res Inst Integrated Circuits, Chongqing 400031, Peoples R China
关键词
3D object detection; point cloud; tiny machine learning (TinyML); FPGA;
D O I
10.1109/TCSVT.2023.3297620
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Limited by huge computational cost, high inference latency and large memory consumption, existing 3D point cloud object detection methods are hard to be deployed on Internet of Things (IoT) edge devices. To handle this challenge, we present an extremely tiny framework termed TinyPillarNet. This framework leverages innovative pillar encoder to represent point cloud as immensely tiny pseudo-maps for extremely shrinking the input 3D sensing data. Moreover, a compact dual-stream feature extraction network is put forward to respectively extract intrinsic feature and distributional saliency map, which jointly boosts the detection precision with the lowest hardware cost. Extended experiments on KITTI benchmark demonstrated that our TinyPillarNet yields applicable precision with a record tiny weight size of 1.69 MB at a high inference speed of 1.67 times faster than the current record. Furthermore, the specially designed prototype verification system achieves a superior energy efficiency, which outperforms the similar deep learning based point cloud processing solutions on FPGA with a big margin.
引用
收藏
页码:1772 / 1785
页数:14
相关论文
共 71 条
  • [1] A Survey on 3D Object Detection Methods for Autonomous Driving Applications
    Arnold, Eduardo
    Al-Jarrah, Omar Y.
    Dianati, Mehrdad
    Fallah, Saber
    Oxtoby, David
    Mouzakitis, Alex
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) : 3782 - 3795
  • [2] Beltrán J, 2018, IEEE INT C INTELL TR, P3517, DOI 10.1109/ITSC.2018.8569311
  • [3] Chen XZ, 2015, ADV NEUR IN, V28
  • [4] Choi J, 2018, Arxiv, DOI arXiv:1805.06085
  • [5] From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection
    Deng, Jiajun
    Zhou, Wengang
    Zhang, Yanyong
    Li, Houqiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4722 - 4734
  • [6] TinyML Meets IoT: A Comprehensive Survey
    Dutta, Lachit
    Bharali, Swapna
    [J]. INTERNET OF THINGS, 2021, 16
  • [7] Layer-Specific Optimization for Mixed Data Flow With Mixed Precision in FPGA Design for CNN-Based Object Detectors
    Duy Thanh Nguyen
    Kim, Hyun
    Lee, Hyuk-Jae
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2450 - 2464
  • [8] Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, DOI 10.48550/ARXIV.1704.04861]
  • [9] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
  • [10] 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks
    Graham, Benjamin
    Engelcke, Martin
    van der Maaten, Laurens
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9224 - 9232