FPGA HPC using OpenCL: Case Study in 3D FFT

被引:10
|
作者
Sanaullah, Ahmed [1 ]
Herbordt, Martin C. [1 ]
机构
[1] Boston Univ, Dept Elect & Comp Engn, Boston, MA 02215 USA
基金
美国国家科学基金会;
关键词
ACCELERATION;
D O I
10.1145/3241793.3241800
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
FPGAs have typically achieved high speedups for 3D Fast Fourier Transforms (FFTs) due to the presence of hard floating point units, low latency specialized pipelines, and support for complex connectivity among processing elements. Previous implementations have relied on FFT IP cores for performing the computation due to the complexity of manually developing and maintaining/upgrading efficient pipelines in HDL. These IP cores, however, are bulky and cannot be fully tuned for specific FFT sizes due to use of generic architectures. HLS tools, such as OpenCL, offer a more customizable alternative but have suffered from worse performance than HDL in previous work. In this paper we show that, using a set of code structure optimizations, OpenCL designs can be compiled to Radix-2 FFT pipelines which outperform IP core based designs for the same throughput. We further show that the HDL generated by the OpenCL compiler can be isolated and seamlessly integrated into existing 3D FFT shells to reduce implementation effort. Our single device design, tested on the Altera Arria10X115 FPGA, achieves an average speedup of 29x vs CPU-MKL, 4.1x vs GPU cuFFT and 1.1x vs IP Core FFT implementations for 16(3), 32(3) and 64(3) FFTs. Moreover, OpenCL generated compute pipelines for 8(3), 16(3), 32(3) and 64(3) FFTs use an average of 7.5x fewer ALMs and 1.6x fewer DSPs than corresponding IP core versions.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Evaluating an OpenCL FPGA Platform for HPC: a Case Study with the HACCmk Kernel
    Jin, Zheming
    Finkel, Hal
    2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,
  • [2] OpenCL for HPC with FPGAs: Case Study in Molecular Electrostatics
    Yang, Chen
    Sheng, Jiayi
    Patel, Rushi
    Sanaullah, Ahmed
    Sachdeva, Vipin
    Herbordt, Martin C.
    2017 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2017,
  • [3] FPGA Architecture for 3-D FDTD Acceleration Using OpenCL
    Waidyasooriya, H. M.
    Hariyama, M.
    Ohtera, Y.
    2016 PROGRESS IN ELECTROMAGNETICS RESEARCH SYMPOSIUM (PIERS), 2016, : 4719 - 4719
  • [4] HPC on FPGA Clouds: 3D FFTs and Implications for Molecular Dynamics
    Sheng, Jiayi
    Yang, Chen
    Sanaullah, Ahmed
    Papamichael, Michael
    Caulfield, Adrian
    Herbordt, Martin C.
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [5] Accelerating the 3-D FFT Using a Heterogeneous FPGA Architecture
    Anderson, Matthew
    Brodowicz, Maciej
    Swany, Martin
    Sterling, Thomas
    EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 653 - 663
  • [6] Velocity Filtering Using Quantum 3D FFT
    Koukiou, Georgia
    Anastassopoulos, Vassilis
    PHOTONICS, 2023, 10 (05)
  • [7] 3D FFT for FPGAs
    Humphries, Ben
    Herbordt, Martin C.
    2013 IEEE CONFERENCE ON HIGH PERFORMANCE EXTREME COMPUTING (HPEC), 2013,
  • [8] 3D Skeleton Extraction Method using Potential Field on OpenCL
    Lu, Lu
    Wang, Xuewen
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SERVICE SYSTEM (CSSS), 2014, 109 : 305 - 309
  • [9] Nuclear Reactor Simulation on OpenCL FPGA : a Case Study of RSBench
    Jin, Zheming
    Finkel, Hal
    IWOCL'18: PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON OPENCL, 2018, : 9 - 17
  • [10] Improving 3D printing of garments by using HPC Cloud
    Tomic, D.
    Davidovic, D.
    Subic, T.
    Mesaric, J.
    Skala, K.
    2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 233 - 236