Exploration and Generation of Efficient FPGA-based Deep Neural Network Accelerators

被引:4
|
作者
Ali, Nermine [1 ]
Philippe, Jean-Marc [1 ]
Tain, Benoit [1 ]
Coussy, Philippe [2 ]
机构
[1] Univ Paris Saclay, CEA, List, F-91120 Palaiseau, France
[2] Univ South Brittany, Lorient, France
来源
2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021) | 2021年
关键词
Convolutional Neural Networks; Design Space Exploration; High Level Synthesis; Embedded Systems; FPGA;
D O I
10.1109/SiPS52927.2021.00030
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional Neural Networks (CNNs) have emerged as an answer to next-generation applications such as complex image recognition and object detection. Embedding such compute-intensive and memory-hungry algorithms on edge systems will lead to smarter high-value applications. However, the algorithmic innovations in the CNN field leave the hardware accelerators one step behind. Reconfigurable hardware (e.g. FPGAs) allows designing custom accelerators adapted to new algorithms. Furthermore, new design approaches such as high-level synthesis (HLS) enable to generate RTL code based on high-level function descriptions. This paper presents a high-level CNN accelerator generation framework for FPGAs. A first phase of the framework characterizes CNN descriptions using hardware-aware metrics. These metrics then drive a hardware generation phase which builds the proper C source code implementation for each layer of the network. Finally, an HLS tool outputs the synthesizable RTL code of the accelerator. This approach aims at reducing the gap between the evolving applications based on artificial intelligence and hardware accelerators, thus reducing time-to-market of new systems.
引用
收藏
页码:123 / 128
页数:6
相关论文
共 50 条
  • [21] FPGA-based Deep Learning Inference Accelerators: Where Are We Standing?
    Nechi, Anouar
    Groth, Lukas
    Mulhem, Saleh
    Merchant, Farhad
    Buchty, Rainer
    Berekovic, Mladen
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (04)
  • [22] FPGA-Based Accelerators of Deep Learning Networks for Learning and Classification: A Review
    Shawahna, Ahmad
    Sait, Sadiq M.
    El-Maleh, Aiman
    IEEE ACCESS, 2019, 7 : 7823 - 7859
  • [23] FPGA-based Acceleration of Neural Network Training
    Sang, Ruoyu
    Liu, Qiang
    Zhang, Qijun
    2016 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO), 2016,
  • [24] Fast Design Exploration for Performance, Power and Accuracy Tradeoffs in FPGA-Based Accelerators
    Ulusel, Onur
    Nepal, Kumud
    Bahar, R. Iris
    Reda, Sherief
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2014, 7 (01)
  • [25] An Efficient FPGA-Based Convolutional Neural Network for Classification: Ad-MobileNet
    Bouguezzi, Safa
    Ben Fredj, Hana
    Belabed, Tarek
    Valderrama, Carlos
    Faiedh, Hassene
    Souani, Chokri
    ELECTRONICS, 2021, 10 (18)
  • [26] FPGA-based Accelerator for Deep Convolutional Neural Networks for the SPARK Environment
    Morcel, Raghid
    Ezzeddine, Mazen
    Akkary, Haitham
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 126 - 133
  • [27] An Efficient FPGA-Based Architecture for Convolutional Neural Networks
    Hwang, Wen-Jyi
    Jhang, Yun-Jie
    Tai, Tsung-Ming
    2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 582 - 588
  • [28] Implementation of FPGA-based Accelerator for Deep Neural Networks
    Tsai, Tsung-Han
    Ho, Yuan-Chen
    Sheu, Ming-Hwa
    2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
  • [29] ExPAN(N)D: Exploring Posits for Efficient Artificial Neural Network Design in FPGA-Based Systems
    Nambi, Suresh
    Ullah, Salim
    Sahoo, Siva Satyendra
    Lohana, Aditya
    Merchant, Farhad
    Kumar, Akash
    IEEE ACCESS, 2021, 9 : 103691 - 103708
  • [30] FPGA-Based High-Performance Data Compression Deep Neural Network Accelerator
    Wang, Hanze
    Fu, Yingxun
    Ma, Li
    2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 563 - 569