Towards End-to-End Compression in Lustre

被引:0
作者
Fuchs, Anna [1 ]
Squar, Jannek [1 ]
Kuhn, Michael [2 ]
机构
[1] Univ Hamburg, Hamburg, Germany
[2] Otto von Guericke Univ, Magdeburg, Germany
来源
2024 23RD INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING, ISPDC 2024 | 2024年
关键词
Compression; Lustre; Parallel File System; Access Patterns; I/O;
D O I
10.1109/ISPDC62236.2024.10705396
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific applications generate massive amounts of data, posing storage limitations and network traffic challenges. While scientists struggle with the usage of application-side compression and parallel I/O, we design a transparent feature. Our Lustre-based prototype automatically applies lossless compression, offering flexibility in compression-related decisions to minimize computational costs and optimize application performance. We outline the challenges posed by our prototype and illustrate through a comprehensive assessment how integrating Lustre and ZFS as backend solutions provides the essential elements for performance and scalability: specifically, asynchronous operations and parallel processing of compression. In our evaluation, we illustrate the interaction of different buffer levels within a distributed system. Additionally, we showcase how the I/O pattern, hardware setup, and various system software optimizations can impact overall performance and influence the choice of compression strategy.
引用
收藏
页数:8
相关论文
共 12 条
[1]   I/O Access Patterns in HPC Applications: A 360-Degree Survey [J].
Bez, Jean Luca ;
Byna, Suren ;
Ibrahim, Shadi .
ACM COMPUTING SURVEYS, 2024, 56 (02)
[2]  
Blagodarenko Artem, 2023, Client-side data compression
[3]  
Farrell Patrick, 2023, Lad 2023: Buffered i/o, dio & unaligned dio
[4]  
Farrell Patrick, 2017, Lustre on flash
[5]  
Kuhn Michael, 2016, [Supercomputing Frontiers and Innovations, Supercomputing Frontiers and Innovations], V3, P75
[6]  
Lockwood G. K., 2017, Storage 2020: A Vision for the Future of HPC Storage, V10
[7]   Idle waves in high-performance computing [J].
Markidis, Stefano ;
Vencels, Juris ;
Peng, Ivy Bo ;
Akhmetova, Dana ;
Laure, Erwin ;
Henri, Pierre .
PHYSICAL REVIEW E, 2015, 91 (01)
[8]  
Nelson Mark, 2022, An initial look at deep learning io performance
[9]  
Pourmal Elena, 2017, Hdf5 data compression demystified n2: Performance tuning
[10]  
Purandare Devashish R., 2022, Analysis and workload characterization of the CERN EOS storage system, P1