Exploration and Generation of Efficient FPGA-based Deep Neural Network Accelerators

被引:4
|
作者
Ali, Nermine [1 ]
Philippe, Jean-Marc [1 ]
Tain, Benoit [1 ]
Coussy, Philippe [2 ]
机构
[1] Univ Paris Saclay, CEA, List, F-91120 Palaiseau, France
[2] Univ South Brittany, Lorient, France
来源
2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021) | 2021年
关键词
Convolutional Neural Networks; Design Space Exploration; High Level Synthesis; Embedded Systems; FPGA;
D O I
10.1109/SiPS52927.2021.00030
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional Neural Networks (CNNs) have emerged as an answer to next-generation applications such as complex image recognition and object detection. Embedding such compute-intensive and memory-hungry algorithms on edge systems will lead to smarter high-value applications. However, the algorithmic innovations in the CNN field leave the hardware accelerators one step behind. Reconfigurable hardware (e.g. FPGAs) allows designing custom accelerators adapted to new algorithms. Furthermore, new design approaches such as high-level synthesis (HLS) enable to generate RTL code based on high-level function descriptions. This paper presents a high-level CNN accelerator generation framework for FPGAs. A first phase of the framework characterizes CNN descriptions using hardware-aware metrics. These metrics then drive a hardware generation phase which builds the proper C source code implementation for each layer of the network. Finally, an HLS tool outputs the synthesizable RTL code of the accelerator. This approach aims at reducing the gap between the evolving applications based on artificial intelligence and hardware accelerators, thus reducing time-to-market of new systems.
引用
收藏
页码:123 / 128
页数:6
相关论文
共 50 条
  • [1] Generating Efficient FPGA-based CNN Accelerators from High-Level Descriptions
    Nermine Ali
    Jean-Marc Philippe
    Benoit Tain
    Philippe Coussy
    Journal of Signal Processing Systems, 2022, 94 : 945 - 960
  • [2] Generating Efficient FPGA-based CNN Accelerators from High-Level Descriptions
    Ali, Nermine
    Philippe, Jean-Marc
    Tain, Benoit
    Coussy, Philippe
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (10): : 945 - 960
  • [3] Design Space Exploration of HW Accelerators and Network Infrastructure for FPGA-Based MPSoC
    Dammak, Bouthaina
    Baklouti, Mouna
    Alsekait, Deema
    IEEE ACCESS, 2024, 12 : 15280 - 15289
  • [4] Towards Efficient Design Space Exploration of FPGA-based Accelerators for Streaming HPC Applications
    Koraei, Mostafa
    Jahre, Magnus
    Fatemi, S. Omid
    FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, : 287 - 287
  • [5] Optimizing Temporal Convolutional Network Inference on FPGA-Based Accelerators
    Carreras, Marco
    Deriu, Gianfranco
    Raffo, Luigi
    Benini, Luca
    Meloni, Paolo
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2020, 10 (03) : 348 - 361
  • [6] Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators
    Guan, Yijin
    Xu, Ningyi
    Zhang, Chen
    Yuan, Zhihang
    Cong, Jason
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, 2017, 10561 : 14 - 26
  • [7] The Progress and Trends of FPGA-Based Accelerators in Deep Learning
    Wu Y.-X.
    Liang K.
    Liu Y.
    Cui H.-M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (11): : 2461 - 2480
  • [8] A survey of FPGA-based accelerators for convolutional neural networks
    Sparsh Mittal
    Neural Computing and Applications, 2020, 32 : 1109 - 1139
  • [9] A Survey and Taxonomy of FPGA-based Deep Learning Accelerators
    Blaiech, Ahmed Ghazi
    Ben Khalifa, Khaled
    Valderrama, Carlos
    Fernandes, Marcelo A. C.
    Bedoui, Mohamed Hedi
    JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 98 : 331 - 345
  • [10] Throughput optimizations for FPGA-based deep neural network inference
    Posewsky, Thorbjoern
    Ziener, Daniel
    MICROPROCESSORS AND MICROSYSTEMS, 2018, 60 : 151 - 161