Generating High-Performance FPGA Accelerator Designs for Big Data Analytics with Fletcher and Apache Arrow

被引:1
|
作者
Peltenburg, Johan [1 ]
van Straten, Jeroen [1 ]
Brobbel, Matthijs [1 ]
Al-Ars, Zaid [1 ]
Hofstee, H. Peter [1 ,2 ]
机构
[1] Delft Univ Technol, Delft, Netherlands
[2] IBM Corp, Austin, TX USA
来源
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2021年 / 93卷 / 05期
关键词
FPGA; Accelerator; Big data; Analytics; Fletcher; Apache Arrow;
D O I
10.1007/s11265-021-01650-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As big data analytics systems are squeezing out the last bits of performance of CPUs and GPUs, the next near-term and widely available alternative industry is considering for higher performance in the data center and cloud is the FPGA accelerator. We discuss several challenges a developer has to face when designing and integrating FPGA accelerators for big data analytics pipelines. On the software side, we observe complex run-time systems, hardware-unfriendly in-memory layouts of data sets, and (de)serialization overhead. On the hardware side, we observe a relative lack of platform-agnostic open-source tooling, a high design effort for data structure-specific interfaces, and a high design effort for infrastructure. The open source Fletcher framework addresses these challenges. It is built on top of Apache Arrow, which provides a common, hardware-friendly in-memory format to allow zero-copy communication of large tabular data, preventing (de)serialization overhead. Fletcher adds FPGA accelerators to the list of over eleven supported software languages. To deal with the hardware challenges, we present Arrow-specific components, providing easy-to-use, high-performance interfaces to accelerated kernels. The components are combined based on a generic architecture that is specialized according to the application through an extensive infrastructure generation framework that is presented in this article. All generated hardware is vendor-agnostic, and software drivers add a platform-agnostic layer, allowing users to create portable implementations.
引用
收藏
页码:565 / 586
页数:22
相关论文
共 50 条
  • [41] Big Data Benchmarks of High-Performance Storage Systems on Commercial Bare Metal Clouds
    Lee, Hyungro
    Fox, Geoffrey C.
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 1 - 8
  • [42] High-Performance Spatial Query Processing on Big Taxi Trip Data using GPGPUs
    Zhang, Jianting
    You, Simin
    Gruenwald, Le
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 72 - 79
  • [43] High-performance Decoding of Variable-length Memory Data Packets for FPGA Stream Processing
    Sierra, Roberto
    Mangani, Filippo
    Carreras, Carlos
    Caffarena, Gabriel
    2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2019, : 307 - 313
  • [44] Design and Development of FPGA-based High-Performance Radar Data Stream Mining System
    Liu, Ying
    Ma, Pengshan
    Cui, Hongyuan
    3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2015, 2015, 55 : 876 - 885
  • [45] High-Performance FPGA Streaming Data Concentrator for GEM Electronic Measurement System for WEST Tokamak
    Kolasinski, Piotr
    Pozniak, Krzysztof T.
    Wojenski, Andrzej
    Linczuk, Pawel
    Kasprowicz, Grzegorz
    Chernyshova, Maryna
    Mazon, Didier
    Czarski, Tomasz
    Colnel, Julian
    Malinowski, Karol
    Guibert, Denis
    ELECTRONICS, 2023, 12 (17)
  • [46] EVOLVE: Towards Converging Big-Data, High-Performance and Cloud-Computing Worlds
    Tzenetopoulos, Achilleas
    Masouros, Dimosthenis
    Koliogeorgi, Konstantina
    Xydis, Sotirios
    Soudris, Dimitrios
    Chazapis, Antony
    Kozanitis, Christos
    Bilas, Angelos
    Pinto, Christian
    Huy-Nam Nguyen
    Louloudakis, Stelios
    Gardikis, Georgios
    Vamvakas, George
    Aubrun, Michelle
    Symeonidou, Christy
    Spitadakis, Vassilis
    Xylogiannopoulos, Konstantinos
    Peischl, Bernhard
    Kalayci, Tahir Emre
    Stocker, Alexander
    Acquaviva, Jean-Thomas
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 975 - 980
  • [47] Big Data solutions on a small scale: Evaluating accessible high-performance computing for social research
    Murthy, Dhiraj
    Bowman, Sawyer A.
    BIG DATA & SOCIETY, 2014, 1 (02):
  • [48] Blockchain-Enabled Intelligent IoT Protocol for High-Performance and Secured Big Financial Data Transaction
    Saba, Tanzila
    Haseeb, Khalid
    Rehman, Amjad
    Jeon, Gwanggil
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1667 - 1674
  • [50] Emulation of high-performance correlation-based quantum clustering algorithm for two-dimensional data on FPGA
    Talal Bonny
    A. Haq
    Quantum Information Processing, 2020, 19