Point Cloud Acceleration by Exploiting Geometric Similarity

被引:4
作者
Chen, Cen [1 ,2 ]
Zou, Xiaofeng [1 ]
Shao, Hongen [1 ]
Li, Yangfan [3 ]
Li, Kenli [4 ]
机构
[1] South China Univ Technol, Sch Future Technol, Guangzhou, Peoples R China
[2] Pazhou Lab, Guangzhou, Peoples R China
[3] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[4] Hunan Univ, Coll Informat Sci & Engn, Changsha, Peoples R China
来源
56TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2023 | 2023年
基金
中国国家自然科学基金;
关键词
Hardware Accelerator; Point cloud; Redundancy-Aware Computation; Software-Hardware Co-Design; NEURAL-NETWORK;
D O I
10.1145/3613424.3614290
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning on point clouds has attracted increasing attention for various emerging 3D computer vision applications, such as autonomous driving, robotics, and virtual reality. These applications interact with people in real-time on edge devices and thus require low latency and low energy. To accelerate the execution of deep neural networks (DNNs) on point clouds, some customized accelerators have been proposed, which achieved a significantly higher performance with reduced energy consumption than GPUs and existing DNN accelerators. In this work, we reveal that DNNs execution on geometrically adjacent points exhibits similar values and relations, and exhibits a large amount of redundant computation and communication due to the correlations. To address this issue, we propose GDPCA, a geometry-aware differential point cloud accelerator, which can exploit geometric similarity to reduce these redundancies for point cloud neural networks. GDPCA is supported by an algorithm and architecture co-design. Our proposed algorithm can discover and reduce computation and communication redundancies with geometry-aware and differential execution mechanisms. Then a novel architecture is designed to support the proposed algorithm and transform the redundancy reduction into performance improvement. GDPCA performs the same computations and gives the same accuracy as traditional point cloud neural networks. To the best of our knowledge, GDPCA is the first accelerator that can reduce execution redundancies for point cloud neural networks by exploiting geometric similarity. Our proposed GDPCA system gains an average of 2.9x speedup and 2.7x energy efficiency over state-of-the-art accelerators for point cloud neural networks.
引用
收藏
页码:1135 / 1147
页数:13
相关论文
共 51 条
[1]   Bit-Pragmatic Deep Neural Network Computing [J].
Albericio, Jorge ;
Delmas, Alberto ;
Judd, Patrick ;
Sharify, Sayeh ;
O'Leary, Gerard ;
Genov, Roman ;
Moshovos, Andreas .
50TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2017, :382-394
[2]  
Ali W, 2018, P EUR C COMP VIS ECC
[3]   ReGNN: A Redundancy-Eliminated Graph Neural Networks Accelerator [J].
Chen, Cen ;
Li, Kenli ;
Li, Yangfan ;
Zou, Xiaofeng .
2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, :429-443
[4]   DyGNN: Algorithm and Architecture Support of Dynamic Pruning for Graph Neural Networks [J].
Chen, Cen ;
Li, Kenli ;
Zou, Xiaofeng ;
Li, Yangfan .
2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, :1201-1206
[5]   3D Point Cloud Processing and Learning for Autonomous Driving: Impacting Map Creation, Localization, and Perception [J].
Chen, Siheng ;
Liu, Baoan ;
Feng, Chen ;
Vallespi-Gonzalez, Carlos ;
Wellington, Carl .
IEEE SIGNAL PROCESSING MAGAZINE, 2021, 38 (01) :68-86
[6]   Deep Learning for Image and Point Cloud Fusion in Autonomous Driving: A Review [J].
Cui, Yaodong ;
Chen, Ren ;
Chu, Wenbo ;
Chen, Long ;
Tian, Daxin ;
Li, Ying ;
Cao, Dongpu .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) :722-739
[7]   Crescent: Taming Memory Irregularities for Accelerating Deep Point Cloud Analytics [J].
Feng, Yu ;
Hammonds, Gunnar ;
Gan, Yiming ;
Zhu, Yuhao .
PROCEEDINGS OF THE 2022 THE 49TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '22), 2022, :962-977
[8]   Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation [J].
Feng, Yu ;
Tian, Boyuan ;
Xu, Tiancheng ;
Whatmough, Paul ;
Zhu, Yuhao .
2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020), 2020, :1037-1050
[9]   Spartus: A 9.4 TOp/s FPGA-Based LSTM Accelerator Exploiting Spatio-Temporal Sparsity [J].
Gao, Chang ;
Delbruck, Tobi ;
Liu, Shih-Chii .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) :1098-1112
[10]   EdgeDRNN: Recurrent Neural Network Accelerator for Edge Inference [J].
Gao, Chang ;
Rios-Navarro, Antonio ;
Chen, Xi ;
Liu, Shih-Chii ;
Delbruck, Tobi .
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2020, 10 (04) :419-432