Exploration and Generation of Efficient FPGA-based Deep Neural Network Accelerators

被引：4

作者：

Ali, Nermine ^{[1
]}

Philippe, Jean-Marc ^{[1
]}

Tain, Benoit ^{[1
]}

Coussy, Philippe ^{[2
]}

机构：

[1] Univ Paris Saclay, CEA, List, F-91120 Palaiseau, France

[2] Univ South Brittany, Lorient, France

来源：

2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021) | 2021年

关键词：

Convolutional Neural Networks; Design Space Exploration; High Level Synthesis; Embedded Systems; FPGA;

D O I：

10.1109/SiPS52927.2021.00030

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Convolutional Neural Networks (CNNs) have emerged as an answer to next-generation applications such as complex image recognition and object detection. Embedding such compute-intensive and memory-hungry algorithms on edge systems will lead to smarter high-value applications. However, the algorithmic innovations in the CNN field leave the hardware accelerators one step behind. Reconfigurable hardware (e.g. FPGAs) allows designing custom accelerators adapted to new algorithms. Furthermore, new design approaches such as high-level synthesis (HLS) enable to generate RTL code based on high-level function descriptions. This paper presents a high-level CNN accelerator generation framework for FPGAs. A first phase of the framework characterizes CNN descriptions using hardware-aware metrics. These metrics then drive a hardware generation phase which builds the proper C source code implementation for each layer of the network. Finally, an HLS tool outputs the synthesizable RTL code of the accelerator. This approach aims at reducing the gap between the evolving applications based on artificial intelligence and hardware accelerators, thus reducing time-to-market of new systems.

引用

页码：123 / 128

页数：6

共 50 条

[21] FPGA-based Deep Learning Inference Accelerators: Where Are We Standing?
Nechi, Anouar
Groth, Lukas
Mulhem, Saleh
Merchant, Farhad
Buchty, Rainer
Berekovic, Mladen
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (04)
[22] FPGA-Based Accelerators of Deep Learning Networks for Learning and Classification: A Review
Shawahna, Ahmad
Sait, Sadiq M.
El-Maleh, Aiman
IEEE ACCESS, 2019, 7 : 7823 - 7859
[23] FPGA-based Acceleration of Neural Network Training
Sang, Ruoyu
Liu, Qiang
Zhang, Qijun
2016 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO), 2016,
[24] Fast Design Exploration for Performance, Power and Accuracy Tradeoffs in FPGA-Based Accelerators
Ulusel, Onur
Nepal, Kumud
Bahar, R. Iris
Reda, Sherief
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2014, 7 (01)
[25] An Efficient FPGA-Based Convolutional Neural Network for Classification: Ad-MobileNet
Bouguezzi, Safa
Ben Fredj, Hana
Belabed, Tarek
Valderrama, Carlos
Faiedh, Hassene
Souani, Chokri
ELECTRONICS, 2021, 10 (18)
[26] FPGA-based Accelerator for Deep Convolutional Neural Networks for the SPARK Environment
Morcel, Raghid
Ezzeddine, Mazen
Akkary, Haitham
2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 126 - 133
[27] An Efficient FPGA-Based Architecture for Convolutional Neural Networks
Hwang, Wen-Jyi
Jhang, Yun-Jie
Tai, Tsung-Ming
2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 582 - 588
[28] Implementation of FPGA-based Accelerator for Deep Neural Networks
Tsai, Tsung-Han
Ho, Yuan-Chen
Sheu, Ming-Hwa
2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
[29] ExPAN(N)D: Exploring Posits for Efficient Artificial Neural Network Design in FPGA-Based Systems
Nambi, Suresh
Ullah, Salim
Sahoo, Siva Satyendra
Lohana, Aditya
Merchant, Farhad
Kumar, Akash
IEEE ACCESS, 2021, 9 : 103691 - 103708
[30] FPGA-Based High-Performance Data Compression Deep Neural Network Accelerator
Wang, Hanze
Fu, Yingxun
Ma, Li
2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 563 - 569

← 1 2 3 4 5 →