Design and Evaluation of Multiple-Level Data Staging for Blue Gene Systems

被引:12
作者
Isaila, Florin [1 ]
Blas, Javier Garcia [1 ]
Carretero, Jesus [1 ]
Latham, Robert [2 ]
Ross, Robert [2 ]
机构
[1] Univ Carlos III Madrid, Leganes 28911, Madrid, Spain
[2] Argonne Natl Lab, Argonne, IL 60439 USA
基金
美国国家科学基金会;
关键词
MPI-IO; parallel I/O; parallel file systems; supercomputers; I/O;
D O I
10.1109/TPDS.2010.127
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Parallel applications currently suffer from a significant imbalance between computational power and available I/O bandwidth. Additionally, the hierarchical organization of current Petascale systems contributes to an increase of the I/O subsystem latency. In these hierarchies, file access involves pipelining data through several networks with incremental latencies and higher probability of congestion. Future Exascale systems are likely to share this trait. This paper presents a scalable parallel I/O software system designed to transparently hide the latency of file system accesses to applications on these platforms. Our solution takes advantage of the hierarchy of networks involved in file accesses, to maximize the degree of overlap between computation, file I/O-related communication, and file system access. We describe and evaluate a two-level hierarchy for Blue Gene systems consisting of client-side and I/O node-side caching. Our file cache management modules coordinate the data staging between application and storage through the Blue Gene networks. The experimental results demonstrate that our architecture achieves significant performance improvements through a high degree of overlap between computation, communication, and file I/O.
引用
收藏
页码:946 / 959
页数:14
相关论文
共 35 条
[31]  
Shan H., 2008, PROC 2008 ACM IEEE C, P1
[32]   Data sieving and collective I/O in ROMIO [J].
Thakur, R ;
Gropp, W ;
Lusk, E .
FRONTIERS '99 - THE SEVENTH SYMPOSIUM ON THE FRONTIERS OF MASSIVELY PARALLEL COMPUTATION, PROCEEDINGS, 1999, :182-189
[33]   PARALLEL SCRIPTING FOR APPLICATIONS AT THE PETASCALE AND BEYOND [J].
Wilde, Michael ;
Foster, Ian ;
Iskra, Kamil ;
Beckman, Pete ;
Zhang, Zhao ;
Espinosa, Allan ;
Hategan, Mihael ;
Clifford, Ben ;
Raicu, Ioan .
COMPUTER, 2009, 42 (11) :50-60
[34]  
Wong P., 2003, NAS PARALLEL BENCHMA
[35]   OPAL: An open-source MPI-IO library over Cray XT [J].
Yu, Weikuan ;
Vetter, Jeffrey S. ;
Canon, R. Shane .
SNAPI 2007: FOURTH INTERNATIONAL WORKSHOP ON STORAGE NETWORK ARCHITECTURE AND PARALLEL I/OS, PROCEEDINGS, 2007, :41-+