OVERVIEW OF A FPGA-BASED OVERLAY PROCESSOR

被引：0

作者：

Yu, Yunxuan ^{[1
,2
]}

Wu, Chen ^{[1
,2
]}

Shi, Xiao ^{[2
]}

He, Lei ^{[1
,2
]}

机构：

[1] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China

[2] Univ Calif Los Angeles, Elect & Comp Engn Dept, Los Angeles, CA 90095 USA

来源：

2019 CHINA SEMICONDUCTOR TECHNOLOGY INTERNATIONAL CONFERENCE (CSTIC) | 2019年

关键词：

D O I：

10.1109/cstic.2019.8755623

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents the overview of an overlay architecture on FPGA (OPU). It is applicable to general CNN acceleration with software like programmablility and fast compilation time. Good portability is guaranteed as only executable codes are reloaded for applicationswitch without FPGA re-synthesis. Our OPU instructions have complicated functions with variable runtimes but have a uniform length of 32 bits. This large granularity of instructions makes it easier to develop compiler and micro-architecture. Thus, OPU is a good candidate for domain specific processor without large volume. Experiments results show that OPU can achieve around 90 percent of runtime computing resource utilization (PE efficiency) among eight different network applications, which is 2% - 48% better compared with existing customized accelerators.

引用

页数：3

共 8 条

[1]

[Anonymous], 49 ANN IEEE ACM INT

[2]

[Anonymous], 54 ACM EDAC IEEE DES

[3]

[Anonymous], FPGA 17 P 2017 ACM S

[4]

[Anonymous], FPGA 16 P 2016 ACM S

[5]

Ma Y., 2017, 2017 27 INT C, P1

[6] Exploring Heterogeneous Algorithms for Accelerating Deep Convolutional Neural Networks on FPGAs [J].

Xiao, Qincheng ;

Liang, Yun ;

Lu, Liqiang ;

Yan, Shengen ;

Tai, Yu-Wing .

PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,

[7] Automated Systolic Array Architecture Synthesis for High Throughput CNN Inference on FPGAs [J].

Wei, Xuechao ;

Yu, Cody Hao ;

Zhang, Peng ;

Chen, Youxiang ;

Wang, Yuxin ;

Hu, Han ;

Liang, Yun ;

Cong, Jason .

PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,

[8] Caffeine: Towards Uniformed Representation and Acceleration for Deep Convolutional Neural Networks [J].

Zhan, Chen ;

Fang, Zhenman ;

Zhou, Peipei ;

Pan, Peichen ;

Cong, Jason .

2016 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2016,

← 1 →