Design and Application Space Exploration of a Domain-Specific Accelerator System

被引:4
作者
Feng, Fan [1 ]
Li, Li [1 ]
Wang, Kun [1 ]
Fu, Yuxiang [1 ]
He, Guoqiang [2 ]
Pan, Hongbing [1 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210023, Jiangsu, Peoples R China
[2] Nanjing Res Inst Elect Technol, Nanjing 210013, Jiangsu, Peoples R China
关键词
dark silicon; accelerator system; high-performance radar processing; shared scratchpad memory; FPGA prototyping; chip testing; application space exploration;
D O I
10.3390/electronics7040045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Domain-specific accelerators are a reaction adapting to device scaling and the dark silicon era. This paper describes a radar signal processing oriented configurable accelerator and the application space exploration of the system. The system is built around accelerator engines and general-purpose processors (GPPs) that make it suitable for intensive computing kernel acceleration and complex control tasks. It is geared toward high-performance radar digital signal processing; we characterize the applications and find that each of them contains a series of serializable kernels. Taking advantage of this discovery, we design an algorithm pool that shares the same computation resource and memory resource, and each algorithm is size reconfigurable. On the other hand, shared on-chip addressable scratchpad memory eliminates unnecessary explicit data copy between accelerators. Performance of the system is evaluated from measurements performed both on an FPGA SoC test chip and on a prototype chip fabricated by CMOS 40 nm technology. The experimental results show that for different algorithms, the proposed system achieves 1.9 x to 10.1 x performance gain compared with a state-of-the-art TI DSP chip. In order to characterize the application of the system, a complex real-life task is adopted, and the results show that it can obtain high throughput and desirable precision.
引用
收藏
页数:15
相关论文
共 30 条
[1]  
[Anonymous], ACM IEEE INT S MICR
[2]  
[Anonymous], 2014, TMS320C6672 TEX INST
[3]  
[Anonymous], 2013, AN2114
[4]  
[Anonymous], 2014, T4240PB NXP
[5]  
Bin Altaf MS, 2017, 44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), P375, DOI [10.1145/3140659.3080216, 10.1145/3079856.3080216]
[6]   Configurable Floating-Point FFT Accelerator on FPGA Based Multiple-Rotation CORDIC [J].
Chen Jiyang ;
Lei Yuanwu ;
Peng Yuanxi ;
He Tingting ;
Deng Ziye .
CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (06) :1063-1070
[7]   Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks [J].
Chen, Yu-Hsin ;
Krishna, Tushar ;
Emer, Joel S. ;
Sze, Vivienne .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2017, 52 (01) :127-138
[8]   NEURAL ACCELERATION FOR GENERAL-PURPOSE APPROXIMATE PROGRAMS [J].
Esmaeilzadeh, Hadi ;
Sampson, Adrian ;
Ceze, Luis ;
Burger, Doug .
IEEE MICRO, 2013, 33 (03) :16-27
[9]   DARK SILICON AND THE END OF MULTICORE SCALING [J].
Esmaeilzadeh, Hadi ;
Blem, Emily ;
St Amant, Renee ;
Sankaralingam, Karthikeyan ;
Burger, Doug .
IEEE MICRO, 2012, 32 (03) :122-134
[10]  
FENG F, 2017, P 2017 IEEE 12 INT C, P287