Exploration and Generation of Efficient FPGA-based Deep Neural Network Accelerators

被引:4
作者
Ali, Nermine [1 ]
Philippe, Jean-Marc [1 ]
Tain, Benoit [1 ]
Coussy, Philippe [2 ]
机构
[1] Univ Paris Saclay, CEA, List, F-91120 Palaiseau, France
[2] Univ South Brittany, Lorient, France
来源
2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021) | 2021年
关键词
Convolutional Neural Networks; Design Space Exploration; High Level Synthesis; Embedded Systems; FPGA;
D O I
10.1109/SiPS52927.2021.00030
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional Neural Networks (CNNs) have emerged as an answer to next-generation applications such as complex image recognition and object detection. Embedding such compute-intensive and memory-hungry algorithms on edge systems will lead to smarter high-value applications. However, the algorithmic innovations in the CNN field leave the hardware accelerators one step behind. Reconfigurable hardware (e.g. FPGAs) allows designing custom accelerators adapted to new algorithms. Furthermore, new design approaches such as high-level synthesis (HLS) enable to generate RTL code based on high-level function descriptions. This paper presents a high-level CNN accelerator generation framework for FPGAs. A first phase of the framework characterizes CNN descriptions using hardware-aware metrics. These metrics then drive a hardware generation phase which builds the proper C source code implementation for each layer of the network. Finally, an HLS tool outputs the synthesizable RTL code of the accelerator. This approach aims at reducing the gap between the evolving applications based on artificial intelligence and hardware accelerators, thus reducing time-to-market of new systems.
引用
收藏
页码:123 / 128
页数:6
相关论文
共 18 条
[1]  
Ali N., 2020, PROC IEEE INT WORKSH, P1
[2]  
Carbon A, 2018, DES AUT TEST EUROPE, P1039, DOI 10.23919/DATE.2018.8342165
[3]   Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks [J].
Chen, Yu-Hsin ;
Krishna, Tushar ;
Emer, Joel S. ;
Sze, Vivienne .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2017, 52 (01) :127-138
[4]  
Desoli G, 2017, ISSCC DIG TECH PAP I, P238, DOI 10.1109/ISSCC.2017.7870349
[5]  
Erdem A, 2018, IEEE INT CONF ASAP, P113
[6]   FP-DNN: An Automated Framework for Mapping Deep Neural Networks onto FPGAs with RTL-HLS Hybrid Templates [J].
Guan, Yijin ;
Liang, Hao ;
Xu, Ningyi ;
Wang, Wenqiang ;
Shi, Shaoshuai ;
Chen, Xi ;
Sun, Guangyu ;
Zhang, Wei ;
Cong, Jason .
2017 IEEE 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2017), 2017, :152-159
[7]  
Han S., 2016, P INT C LEARN REPR I, P1
[8]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[9]   Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories [J].
Li Fei-Fei ;
Fergus, Rob ;
Perona, Pietro .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2007, 106 (01) :59-70
[10]  
List, 2021, N2D2 NEURAL NETWORK