Accelerating memory and I/O intensive HPC applications using hardware compression

被引:0
作者
AlSaleh, Saleh [1 ,2 ]
Elrabaa, Muhammad E. S. [1 ,2 ]
El-Maleh, Aiman [1 ,2 ]
Daud, Khaled [1 ,2 ]
Hroub, Ayman [3 ]
Mudawar, Muhamed [1 ,2 ]
Tonellot, Thierry [4 ]
机构
[1] King Fahd Univ Petr & Minerals, Comp Engn Dept, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Dhahran, Saudi Arabia
[3] Birzeit Univ, Dept Elect & Comp Engn, Ramallah, Palestine
[4] Saudi ARAMCO, EXPEC ARC Adv Res Ctr, Dhahran, Saudi Arabia
关键词
High performance computing; Reconfigurable computing; FPGA accelerators; Data compression; Memory intensive applications; Hardware co-design; LOSSLESS COMPRESSION;
D O I
10.1016/j.jpdc.2024.104955
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently, accelerator-based compression/decompression was proposed to hide the storage latency of highperformance computing (HPC) applications that generate/ingest large data that cannot fit a node's memory. In this work, such a scheme has been implemented using a novel FPGA-based lossy compression/decompression scheme that has very low-latency. The proposed scheme completely overlaps the movement of the application's data with its compute kernels on the CPU with minimal impact on these kernels. Experiments showed that it can yield performance levels on-par with utilizing memory-only storage buffers, even though data is actually stored on disk. Experiments also showed that compared to CPU- and GPU-based compression frameworks, it achieves better performance levels at a fraction of the power consumption.
引用
收藏
页数:13
相关论文
共 32 条
[1]  
Abdelkhalek Rached, 2009, 2009 International Conference on High Performance Computing & Simulation (HPCS), P36, DOI 10.1109/HPCSIM.2009.5192786
[2]  
[Anonymous], 2023, Nvcomp
[3]  
Bartik M, 2019, 2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), P732, DOI 10.1109/CCWC.2019.8666521
[4]  
Bartík M, 2015, IEEE I C ELECT CIRC, P179, DOI 10.1109/ICECS.2015.7440278
[5]  
Collet Y., Lz4-extremely fast compression
[6]   Fast Error-bounded Lossy HPC Data Compression with SZ [J].
Di, Sheng ;
Cappello, Franck .
2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, :730-739
[7]  
Duwe K., 2020, Supercomput. Front. Innov., V7
[8]   A Scalable High-Bandwidth Architecture for Lossless Compression on FPGAs [J].
Fowers, Jeremy ;
Kim, Joo-Young ;
Burger, Doug ;
Hauck, Scott .
2015 IEEE 23RD ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2015, :52-59
[9]  
Gailly J.-l., The gzip home page
[10]   DE-ZFP: An FPGA implementation of a modified ZFP compression/decompression algorithm [J].
Habboush, Mahmoud ;
El-Maleh, Aiman H. ;
Elrabaa, Muhammad E. S. ;
AlSaleh, Saleh .
MICROPROCESSORS AND MICROSYSTEMS, 2022, 90