Accelerating DNNs from local to virtualized FPGA in the Cloud: A survey of trends

被引:10
作者
Wu, Chen [1 ]
Fresse, Virginie [1 ]
Suffran, Benoit [2 ]
Konik, Hubert [1 ]
机构
[1] Univ Lyon, Hubert Curien Lab, Univ St Etienne, Lyon, France
[2] ST Microelect, F-38000 Grenoble, France
关键词
FPGA virtualization; Cloud computing; Deep neural network; Accelerator; Trends; NEURAL-NETWORK; DESIGN; CNN; SCALE; FLOW;
D O I
10.1016/j.sysarc.2021.102257
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Field-programmable gate arrays (FPGAs) are widely used locally to speed up deep neural network (DNN) algorithms with high computational throughput and energy efficiency. Virtualizing FPGA and deploying FPGAs in the cloud are becoming increasingly attractive methods for DNN acceleration because they can enhance the computing ability to achieve on-demand acceleration across multiple users. In the past five years, researchers have extensively investigated various directions of FPGA-based DNN accelerators, such as algorithm optimization, architecture exploration, capacity improvement, resource sharing, and cloud construction. However, previous DNN accelerator surveys mainly focused on optimizing the DNN performance on a local FPGA, ignoring the trend of placing DNN accelerators in the cloud's FPGA. In this study, we conducted an in-depth investigation of the technologies used in FPGA-based DNN accelerators, including but not limited to architectural design, optimization strategies, virtualization technologies, and cloud services. Additionally, we studied the evolution of DNN accelerators, e.g., from a single DNN to framework-generated DNNs, from physical to virtualized FPGAs, from local to the cloud, and from single-user to multi-tenant. We also identified significant obstacles for DNN acceleration in the cloud. This article enhances the current understanding of the evolution of FPGA-based DNN accelerators.
引用
收藏
页数:15
相关论文
共 118 条
[91]   A Survey on FPGA Virtualization [J].
Vaishnav, Anuj ;
Khoa Dang Pham ;
Koch, Dirk .
2018 28TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2018, :131-138
[92]   Resource Elastic Virtualization for FPGAs using OpenCL [J].
Vaishnav, Anuj ;
Khoa Dang Pham ;
Koch, Dirk ;
Garside, James .
2018 28TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2018, :111-118
[93]  
Vaswani A, 2017, ADV NEUR IN, V30
[94]  
Venieris S.I., 2021, ARXIV PREPRINT ARXIV
[95]   f-CNNx: A Toolflow for Mapping Multiple Convolutional Neural Networks on FPGAs [J].
Venieris, Stylianos I. ;
Bouganis, Christos-Savvas .
2018 28TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2018, :381-388
[96]   Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions [J].
Venieris, Stylianos I. ;
Kouris, Alexandros ;
Bouganis, Christos-Savvas .
ACM COMPUTING SURVEYS, 2018, 51 (03)
[97]   fpgaConvNet: A Framework for Mapping Convolutional Neural Networks on FPGAs [J].
Venieris, Stylianos I. ;
Bouganis, Christos-Savvas .
2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, :40-47
[98]   FPGA Dynamic and Partial Reconfiguration: A Survey of Architectures, Methods, and Applications [J].
Vipin, Kizheppatt ;
Fahmy, Suhaib A. .
ACM COMPUTING SURVEYS, 2018, 51 (04)
[99]   LUTNet: Learning FPGA Configurations for Highly Efficient Neural Network Inference [J].
Wang, Erwei ;
Davis, James J. ;
Cheung, Peter Y. K. ;
Constantinides, George A. .
IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (12) :1795-1808
[100]   Enabling FPGAs in Hyperscale Data Centers [J].
Weerasinghe, Jagath ;
Abel, Francois ;
Hagleitner, Christoph ;
Herkersdorf, Andreas .
IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, :1078-1086