Power-Driven DNN Dataflow Optimization on FPGA

被引:9
作者
Sun, Qi [1 ]
Chen, Tinghuan [1 ]
Miao, Jin [2 ]
Yu, Bei [1 ]
机构
[1] Chinese Univ Hong Kong, CSE Dept, Hong Kong, Peoples R China
[2] Cadence Design Syst, San Jose, CA USA
来源
2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) | 2019年
关键词
FPGA; Deep Neural Network; Dataflow Optimization; Power;
D O I
10.1109/iccad45719.2019.8942085
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep neural networks (DNNs) have been proven to achieve unprecedented success on modern artificial intelligence (AI) tasks, which have also greatly motivated the rapid developments of novel DNN models and hardware accelerators. Many challenges still remain towards the design of power efficient DNN accelerator due to the intrinsically intensive data computation and transmission in DNN algorithms. However, most existing efforts in the domain have taken latency as the sole optimization objective, which may often result in sub-optimality in power consumption. In this paper, we propose a framework to optimize the power efficiency of DNN dataflow on FPGA while maximally minimizing the impact on latency. We first propose power and latency models that are built upon different dataflow configurations. Then a power-driven dataflow formulation is proposed, which enables a hierarchical exploration strategy on the dataflow configurations, leading to efficient power consumption at limited latency loss. Experimental results have demonstrated the effectiveness of our proposed models and exploration strategies, where power improvement has shown up to 31% with latency degradation of no worse than 6.5%
引用
收藏
页数:7
相关论文
共 34 条
[1]  
Alwani M., 2016, P 49 ANN IEEE ACM IN, P1, DOI DOI 10.1109/MICRO.2016.7783725
[2]  
[Anonymous], 2019, 56 DAC ACM
[3]  
[Anonymous], IEEE T COMPUTER AIDE
[4]  
[Anonymous], 2018, INT C MACH LEARN
[5]  
[Anonymous], 2015, PROC CVPR IEEE
[6]  
[Anonymous], ADV NEURAL INFORM PR
[7]  
[Anonymous], IEEE T COMPUTER AIDE
[8]  
Chen DM, 2003, ISLPED'03: PROCEEDINGS OF THE 2003 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, P134
[9]   AtICNet: semantic segmentation with atrous spatial pyramid pooling in image cascade network [J].
Chen, Jin ;
Wang, Chuanya ;
Tong, Ying .
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2019, 2019 (1)
[10]  
Chen TQ, 2018, PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P579