Design Optimization for High-Performance Computing Using FPGA

被引:0
|
作者
Isik, Murat [1 ]
Inadagbo, Kayode [2 ]
Aktas, Hakan [3 ]
机构
[1] Drexel Univ, Elect & Comp Engn Dept, Philadelphia, PA 19104 USA
[2] A&M Univ, Elect & Comp Engn Dept, Prairie View, TX USA
[3] Omer Halisdemir Univ, Comp Engn Dept, Nigde, Turkiye
来源
INFORMATION MANAGEMENT AND BIG DATA, SIMBIG 2023 | 2024年 / 2142卷
关键词
High-performance computing; Tensil AI; Design optimization; FPGA; Open-source inference accelerator;
D O I
10.1007/978-3-031-63616-5_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reconfigurable architectures like Field Programmable Gate Arrays (FPGAs) have been used for accelerating computations in several domains because of their unique combination of flexibility, performance, and power efficiency. However, FPGAs have not been widely used for high-performance computing, primarily because of their programming complexity and difficulties in optimizing performance. We optimize Tensil AI's open-source inference accelerator for maximum performance using ResNet20 trained on CIFAR in this paper in order to gain insight into the use of FPGAs for high-performance computing. In this paper, we show how improving hardware design, using Xilinx Ultra RAM, and using advanced compiler strategies can lead to improved inference performance. We also demonstrate that running the CIFAR test data set shows very little accuracy drop when rounding down from the original 32bit floating point. The heterogeneous computing model in our platform allows us to achieve a frame rate of 293.58 frames per second (FPS) and a %90 accuracy on a ResNet20 trained using CIFAR. The experimental results show that the proposed accelerator achieves a throughput of 21.12 Giga-Operations Per Second (GOP/s) with a 5.21W on-chip power consumption at 100 MHz. The comparison results with off-the-shelf devices and recent state-of-the-art implementations illustrate that the proposed accelerator has obvious advantages in terms of energy efficiency.
引用
收藏
页码:142 / 156
页数:15
相关论文
共 50 条
  • [1] Progressive collapse design optimization of RC frame structures using high-performance computing
    Lin, Kaiqi
    Wu, Zewei
    Zhu, Yaqiong
    Zheng, Junhao
    Li, Yi
    Lu, Xinzheng
    STRUCTURES, 2023, 50 : 823 - 834
  • [2] ADD: Accelerator Design and Deploy - A tool for FPGA high-performance dataflow computing
    Penha, Jeronimo C.
    Silva, Lucas B.
    Silva, Jansen M.
    Coelho, Kristtopher K.
    Baranda, Hector P.
    Nacif, Jose Augusto M.
    Ferreira, Ricardo S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (18):
  • [3] Design and Performance Measurement of a High-Performance Computing Cluster
    George, Kiran
    Venugopal, Vivek
    2012 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2012, : 2531 - 2536
  • [4] Educational Design of High-Performance Arithmetic Circuits on FPGA
    Cappuccino, G
    Cappuccino, G
    Corsonello, P
    Perri, S
    IEEE TRANSACTIONS ON EDUCATION, 1999, 42 (04) : 366 - 366
  • [5] Computing infrastructure construction and optimization for high-performance computing and artificial intelligence
    Su, Yun
    Zhou, Jipeng
    Ying, Jiangyong
    Zhou, Mingyao
    Zhou, Bin
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2021, 3 (04) : 331 - 343
  • [6] Computing infrastructure construction and optimization for high-performance computing and artificial intelligence
    Yun Su
    Jipeng Zhou
    Jiangyong Ying
    Mingyao Zhou
    Bin Zhou
    CCF Transactions on High Performance Computing, 2021, 3 : 331 - 343
  • [7] Evaluation and optimization of high-performance computing and networking systems
    Min, Geyong
    Ould-Khaoua, Mohamed
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2007, 10 (02): : 111 - 113
  • [8] Modular High-Performance Computing Using Chiplets
    Vinnakota, Bapi
    Shalf, John M.
    COMPUTING IN SCIENCE & ENGINEERING, 2023, 25 (06) : 39 - 48
  • [9] High-performance computing in accelerating structure design and analysis
    Li, ZH
    Folwell, N
    Ge, LX
    Guetz, A
    Ivanov, V
    Kowalski, M
    Lee, LQ
    Ng, CK
    Schussman, G
    Stingelin, L
    Uplenchwar, R
    Wolf, M
    Xiao, LL
    Ko, K
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2006, 558 (01): : 168 - 174
  • [10] Design and Implementation of High-Performance Space Router Based on FPGA
    Zhou, Dong
    Shen, Xiaohu
    Li, Ke
    Feng, Guoping
    Wang, Luyuan
    2019 IEEE 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2019), 2019, : 704 - 708