FPGA-based Learning Acceleration for LSTM Neural Network

被引:1
|
作者
Dec, Grzegorz Rafal [1 ]
机构
[1] Rzeszow Univ Technol, Dept Comp & Control Engn, W Pola 2, PL-35959 Rzeszow, Poland
关键词
Backpropagation through time; algorithms implemented in hardware; neural nets; reconfigurable hardware;
D O I
10.1142/S0129626423500019
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents and discusses the implementation of a learning accelerator for an LSTM neural network that utilizes an FPGA. The accelerator consists of a backpropagation through time algorithm for an LSTM. The presented net performs a binary classification task and consists of an LSTM and a dense layer. The performance is then compared to both a hard-coded Python implementation and an implementation using Keras library and the GPU. The implementation is executed using the DSP blocks, available via the Vivado Design Suite, which is in compliance with the IEEE754 standard. The results of the simulation show that the FPGA implementation remains accurate and achieves higher speed than the other solutions.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] FHAST: FPGA-Based Acceleration of BOWTIE in Hardware
    Fernandez, Edward B.
    Villarreal, Jason
    Lonardi, Stefano
    Najjar, Walid A.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (05) : 973 - 981
  • [2] FPGA-based Neural Net for Failures Prediction in the Cold Forging Process
    Dec, Grzegorz Rafal
    PARALLEL PROCESSING LETTERS, 2022, 32 (01N02)
  • [3] On efficient implementation of FPGA-based hyperelliptic curve cryptosystems
    Elias, Grace
    Miri, Ali
    Yeap, Tet-Hin
    COMPUTERS & ELECTRICAL ENGINEERING, 2007, 33 (5-6) : 349 - 366
  • [4] Pruning Binarized Neural Networks Enables Low-Latency, Low-Power FPGA-Based Handwritten Digit Classification
    Payra, Syamantak
    Loke, Gabriel
    Fink, Yoel
    Steinmeyer, Joseph D.
    2023 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE, HPEC, 2023,
  • [5] An FPGA-Based Hardware Implementation of Visual based Fall Detection
    Ong, Peng Shen
    Ooi, Chee Pun
    Chang, Yoong Choon
    Karuppiah, Ettikan K.
    Tahir, Shahirina Mohd
    2014 IEEE REGION 10 SYMPOSIUM, 2014, : 397 - 402
  • [6] FPGA-Based Real-Time Implementation of Detection Algorithm for Automatic Traffic Surveillance Sensor Network
    Wojcikowski, Marek
    Zaglewski, Robert
    Pankiewicz, Bogdan
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2012, 68 (01): : 1 - 18
  • [7] FPGA-Based Real-Time Implementation of Detection Algorithm for Automatic Traffic Surveillance Sensor Network
    Marek Wójcikowski
    Robert Żaglewski
    Bogdan Pankiewicz
    Journal of Signal Processing Systems, 2012, 68 : 1 - 18
  • [8] Effects of Runtime Reconfiguration on PUFs Implemented as FPGA-Based Accelerators
    Nassar, Hassan
    Bauer, Lars
    Henkel, Joerg
    IEEE EMBEDDED SYSTEMS LETTERS, 2023, 15 (04) : 174 - 177
  • [9] High level performance metrics for FPGA-based multiprocessor systems
    Beltran, Marta
    Guzman, Antonio
    Sevillano, Fernando
    PERFORMANCE EVALUATION, 2010, 67 (06) : 417 - 431
  • [10] Reconfigurable FPGA-Based FFT Processor for Cognitive Radio Applications
    Ferreira, Mario Lopes
    Barahimi, Amin
    Ferreira, Joao Canas
    APPLIED RECONFIGURABLE COMPUTING, ARC 2016, 2016, : 223 - 232