An Efficient FPGA Accelerator for Point Cloud

被引:0
|
作者
Wang, Zilun [1 ]
Mao, Wendong [1 ]
Yang, Peixiang [1 ]
Wang, Zhongfeng [1 ]
Lin, Jun [1 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing, Peoples R China
来源
2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022) | 2022年
基金
中国国家自然科学基金;
关键词
Point cloud; submanifold sparse convolution; hardware architecture;
D O I
10.1109/SOCC56010.2022.9908099
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning-based point cloud processing plays an important role in various vision tasks, such as autonomous driving, virtual reality (VR), and augmented reality (AR). The submanifold sparse convolutional network (SSCN) has been widely used for the point cloud due to its unique advantages in terms of visual results. However, existing convolutional neural network accelerators suffer from non-trivial performance degradation when employed to accelerate SSCN because of the extreme and unstructured sparsity, and the complex computational dependency between the sparsity of the central activation and the neighborhood ones. In this paper, we propose a high performance FPGA-based accelerator for SSCN. Firstly, we develop a zero removing strategy to remove the coarse-grained redundant regions, thus significantly improving computational efficiency. Secondly, we propose a concise encoding scheme to obtain the matching information for efficient point-wise multiplications. Thirdly, we develop a sparse data matching unit and a computing core based on the proposed encoding scheme, which can convert the irregular sparse operations into regular multiply-accumulate operations. Finally, an efficient hardware architecture for the submanifold sparse convolutional layer is developed and implemented on the Xilinx ZCU102 field-programmable gate array board, where the 3D submanifold sparse U-Net is taken as the benchmark. The experimental results demonstrate that our design drastically improves computational efficiency, and can dramatically improve the power efficiency by 51 times compared to GPU.
引用
收藏
页码:310 / 315
页数:6
相关论文
共 50 条
  • [1] PointAcc: Efficient Point Cloud Accelerator
    Lin, Yujun
    Zhang, Zhekai
    Tang, Haotian
    Wang, Hanrui
    Han, Song
    PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021, 2021, : 449 - 461
  • [2] An FPGA-based kNN Seach Accelerator for point cloud registration
    Wang, Chengliang
    Huang, Zhetong
    Ren, Ao
    Zhang, Xun
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [3] SpCNA: An FPGA-based Accelerator for Point Cloud Convolutional Neural Networks
    Zhou, Gong-Lang
    Guo, Kaiyuan
    Chen, Xiang
    Leung, Kwok Wa
    2023 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, FCCM, 2023, : 211 - 211
  • [4] Efficient CNN Accelerator on FPGA
    Kala, S.
    Nalesh, S.
    IETE JOURNAL OF RESEARCH, 2020, 66 (06) : 733 - 740
  • [5] An Efficient Accelerator for Point-based and Voxel-based Point Cloud Neural Networks
    Yang, Xinhao
    Fu, Tianyu
    Dai, Guohao
    Zeng, Shulin
    Zhong, Kai
    Hong, Ke
    Wang, Yu
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [6] Point Cloud Rendering in FPGA
    Zemcik, Pavel
    Marsik, Lukas
    Herout, Adam
    WSCG 2009, COMMUNICATION PAPERS PROCEEDINGS, 2009, : 63 - 66
  • [7] AN ACCELERATOR OF EFFICIENT DSP BASED ON FPGA
    Zhang Boyu
    Zou Ha
    Tang Ming
    Lin Qiutong
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [8] An Efficient Sparse CNNs Accelerator on FPGA
    Zhang, Yonghua
    Jiang, Hongxu
    Li, Xiaobin
    Wang, Haojie
    Dong, Dong
    Cao, Yongxiang
    2022 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2022), 2022, : 504 - 505
  • [9] Five-point algorithm: an efficient cloud-based FPGA implementation
    Rabozzi, Marco
    Del Sozzo, Emanuele
    Di Tucci, Lorenzo
    Santambrogio, Marco D.
    2018 IEEE 29TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2018, : 9 - 16
  • [10] FLNA: An Energy-Efficient Point Cloud Feature Learning Accelerator with Dataflow Decoupling
    Lyu, Dongxu
    Li, Zhenyu
    Chen, Yuzhou
    Xu, Ningyi
    He, Guanghui
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,