Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation

被引:52
作者
Feng, Yu [1 ]
Tian, Boyuan [1 ]
Xu, Tiancheng [1 ]
Whatmough, Paul [2 ]
Zhu, Yuhao [1 ]
机构
[1] Univ Rochester, Rochester, NY 14627 USA
[2] Arm Res, Austin, TX USA
来源
2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020) | 2020年
关键词
Point cloud; DNN; accelerator;
D O I
10.1109/MICRO50266.2020.00087
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Point cloud analytics is poised to become a key workload on battery-powered embedded and mobile platforms in a wide range of emerging application domains, such as autonomous driving, robotics, and augmented reality, where efficiency is paramount. This paper proposes MESORASI, an algorithm-architecture co-designed system that simultaneously improves the performance and energy efficiency of point cloud analytics while retaining its accuracy. Our extensive characterizations of state-of-the-art point cloud algorithms show that, while structurally reminiscent of convolutional neural networks (CNNs), point cloud algorithms exhibit inherent compute and memory inefficiencies due to the unique characteristics of point cloud data. We propose delayed-aggregation, a new algorithmic primitive for building efficient point cloud algorithms. Delayed-aggregation hides the performance bottlenecks and reduces the compute and memory redundancies by exploiting the approximately distributive property of key operations in point cloud algorithms. Delayed-aggregation let point cloud algorithms achieve 1.6x speedup and 51.1% energy reduction on a mobile GPU while retaining the accuracy (-0.9% loss to 1.2% gains). To maximize the algorithmic benefits, we propose minor extensions to contemporary CNN accelerators, which can be integrated into a mobile Systems-on-a-Chip (SoC) without modifying other SoC components. With additional hardware support, MESORASI achieves up to 3.6x speedup.
引用
收藏
页码:1037 / 1050
页数:14
相关论文
共 67 条
[31]  
Kuhara Takuya., 2013, An FPGA Acceleration for the Kd-tree Search in Photon Mapping
[32]  
Leng Yue., 2019, P 46 INT S COMPUTER
[33]  
Levoy M, 1985, USE POINTS DISPLAY P
[34]  
Liu Y., 2019, P 14 IEEE INT C COMP
[35]  
Liu ZJ, 2019, ADV NEUR IN, V32
[36]  
Mahmoud M., 2017, P 50 ANN IEEE ACM IN
[37]  
Mahmoud Mostafa, 2018, P 51 ANN IEEE ACM IN
[38]  
Mazumdar A, 2017, I S WORKL CHAR PROC, P177, DOI 10.1109/IISWC.2017.8167775
[39]   VIP: Virtualizing IP Chains on Handheld Platforms [J].
Nachiappan, Nachiappan Chidambaram ;
Zhang, Haibo ;
Ryoo, Jihyun ;
Soundararajan, Niranjan ;
Sivasubramaniam, Anand ;
Kandemir, Mahmut T. ;
Iyer, Ravi ;
Das, Chita R. .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :655-667
[40]  
Pfister H, 2000, COMP GRAPH, P335, DOI 10.1145/344779.344936