FPGA HPC using OpenCL: Case Study in 3D FFT

被引:10
|
作者
Sanaullah, Ahmed [1 ]
Herbordt, Martin C. [1 ]
机构
[1] Boston Univ, Dept Elect & Comp Engn, Boston, MA 02215 USA
基金
美国国家科学基金会;
关键词
ACCELERATION;
D O I
10.1145/3241793.3241800
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
FPGAs have typically achieved high speedups for 3D Fast Fourier Transforms (FFTs) due to the presence of hard floating point units, low latency specialized pipelines, and support for complex connectivity among processing elements. Previous implementations have relied on FFT IP cores for performing the computation due to the complexity of manually developing and maintaining/upgrading efficient pipelines in HDL. These IP cores, however, are bulky and cannot be fully tuned for specific FFT sizes due to use of generic architectures. HLS tools, such as OpenCL, offer a more customizable alternative but have suffered from worse performance than HDL in previous work. In this paper we show that, using a set of code structure optimizations, OpenCL designs can be compiled to Radix-2 FFT pipelines which outperform IP core based designs for the same throughput. We further show that the HDL generated by the OpenCL compiler can be isolated and seamlessly integrated into existing 3D FFT shells to reduce implementation effort. Our single device design, tested on the Altera Arria10X115 FPGA, achieves an average speedup of 29x vs CPU-MKL, 4.1x vs GPU cuFFT and 1.1x vs IP Core FFT implementations for 16(3), 32(3) and 64(3) FFTs. Moreover, OpenCL generated compute pipelines for 8(3), 16(3), 32(3) and 64(3) FFTs use an average of 7.5x fewer ALMs and 1.6x fewer DSPs than corresponding IP core versions.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] 3D Tomography Back-Projection Parallelization on Intel FPGAs Using OpenCL
    Maxime Martelli
    Nicolas Gac
    Alain Mérigot
    Cyrille Enderli
    Journal of Signal Processing Systems, 2019, 91 : 731 - 743
  • [22] On-chip Memory Efficient Data Layout for 2D FFT on 3D Memory Integrated FPGA
    Singapura, Shreyas G.
    Kannan, Rajgopal
    Prasanna, Viktor K.
    2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,
  • [23] A case study: Undergraduate self-learning in HPC including OpenMP, MPI, OpenCL, and FPGAs
    Jamieson, Peter
    Herbordt, Martin
    Kinsy, Michel
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 782 - 787
  • [24] Optimizing Complex OpenCL Code for FPGA: A Case Study on Finite Automata Traversal
    Nourian, Marziyeh
    Zarch, Mostafa Eghbali
    Becchi, Michela
    2020 IEEE 26TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2020, : 518 - 527
  • [25] Evaluating Irregular Memory Access on OpenCL FPGA Platforms: a Case Study with XSBench
    Luo, Yingyi
    Wen, Xianshan
    Yoshii, Kazutomo
    Ogrenci-Memik, Seda
    Memik, Gokhan
    Finkel, Hal
    Cappello, Franck
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [26] On the GPU Performance of 3D Stencil Computations Implemented in OpenCL
    Su, Huayou
    Wu, Nan
    Wen, Mei
    Zhang, Chunyuan
    Cai, Xing
    SUPERCOMPUTING (ISC 2013), 2013, 7905 : 125 - 135
  • [27] A Design of Stereoscopic 3D Video Processing System Based on FPGA 3D Formatter in case of FPR
    Sokullu, Radosveta
    Aydin, Mutlu
    2013 INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2013, : 111 - 117
  • [28] 2D Stencil Computation on Cyclone V SoC FPGA using OpenCL
    Firmansyah, Iman
    Wijayanto, Yusuf Nur
    Yamaguchi, Yoshiki
    2018 INTERNATIONAL CONFERENCE ON RADAR, ANTENNA, MICROWAVE, ELECTRONICS, AND TELECOMMUNICATIONS (ICRAMET), 2018, : 121 - 124
  • [29] Protein–ligand docking using FFT based sampling: D3R case study
    Dzmitry Padhorny
    David R. Hall
    Hanieh Mirzaei
    Artem B. Mamonov
    Mohammad Moghadasi
    Andrey Alekseenko
    Dmitri Beglov
    Dima Kozakov
    Journal of Computer-Aided Molecular Design, 2018, 32 : 225 - 230
  • [30] An OpenCL-based speckle matching on the monocular 3D sensor using speckle projection
    Yin, Wei
    Zuo, Chao
    Feng, Shijie
    Tao, Tianyang
    Chen, Qian
    FOURTH INTERNATIONAL CONFERENCE ON PHOTONICS AND OPTICAL ENGINEERING, 2021, 11761