Investigation Of Leading HPC I/O Performance Using A Scientific-Application Derived Benchmark

被引:0
作者
Borrill, Julian [1 ]
Oliker, Leonid [1 ]
Shalf, John [1 ]
Shan, Hongzhang [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Lab, CRD NERSC, Berkeley, CA 94720 USA
来源
2007 ACM/IEEE SC07 CONFERENCE | 2010年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the exponential growth of high-fidelity sensor and simulated data, the scientific community is increasingly reliant on ultrascale HPC resources to handle their data analysis requirements. However, to utilize such extreme computing power effectively, the I/O components must be designed in a balanced fashion, as any architectural bottleneck will quickly render the platform intolerably inefficient. To understand I/O performance of data-intensive applications in realistic computational settings, we develop a lightweight, portable benchmark called MADbench2, which is derived directly from a large-scale Cosmic Microwave Background (CMB) data analysis package. Our study represents one of the most comprehensive I/O analyses of modern parallel filesystems, examining a broad range of system architectures and configurations, including Lustre on the Cray XT3 and Intel Itanium2 cluster; GPFS on IBM Power5 and AMD Opteron platforms; two BlueGene/L installations utilizing GPFS and PVFS2 filesystems; and CXFS on the SGI Altix3700. We present extensive synchronous I/O performance data comparing a number of key parameters including concurrency, POSIX- versus MPI-IO, and unique- versus shared-file accesses, using both the default environment as well as highly-tuned I/O parameters. Finally, we explore the potential of asynchronous I/O and quantify the volume of computation required to hide a given volume of I/O. Overall our study quantifies the vast differences in performance and functionality of parallel filesystems across state-of-the-art platforms, while providing system designers and computational scientists a lightweight tool for conducting further analyses.
引用
收藏
页码:488 / 499
页数:12
相关论文
共 22 条
[1]  
[Anonymous], The PIORAW Test
[2]  
ANTYPAS K, 2006, PDPTA, P292
[3]  
BORRILL J, 1999, 5 EUR SGI CRAY MPP W
[4]  
BORRILL J, 2005, ICPP INT C PAR PROC
[5]  
Braam P. J., 1999, P 2 EXTR LIN TOP WOR
[6]  
CARTER J, 2004, HIPC
[7]  
CHING A, 2003, CLUST 2003 C DEC 4
[8]  
DUFFY D, 2005, C MASS STOR SYST TEC
[9]  
GROPP W, 1984, USING MPI 2 ADV FEAT
[10]   SCALE AND PERFORMANCE IN A DISTRIBUTED FILE SYSTEM [J].
HOWARD, JH ;
KAZAR, ML ;
MENEES, SG ;
NICHOLS, DA ;
SATYANARAYANAN, M ;
SIDEBOTHAM, RN ;
WEST, MJ .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1988, 6 (01) :51-81