Evaluation of collective I/O implementations on parallel architectures

被引:10
作者
Dickens, PM [1 ]
Thakur, R
机构
[1] IIT, Dept Comp Sci, Chicago, IL 60616 USA
[2] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
关键词
D O I
10.1006/jpdc.2000.1733
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we evaluate the impact on performance of various implementation techniques for collective I/O operations, and we do so across four important parallel architectures. We show that a naive implementation of collective I/O does not result in significant performance gains for any of the architectures, but that an optimized implementation does provide excellent performance across till of the platforms under study. Furthermore, we demonstrate that there exists a single implementation strategy that provides the best performance for all four computational platforms. Next, we evaluate implementation techniques for thread-based collective I/O operations. We show that the most obvious implementation technique, which is to spawn a thread to execute the whole collective I/O operation in the background. frequently provides the worst performance, often performing much worse than just executing the collective I/O routine entirely in the foreground. To improve performance, we explore an alternate approach where part of the collective I/O operation is performed in the background, and part is performed in the foreground. We demonstrate that this implementation technique can provide significant performance gains, offering up to a 50% improvement over implementations that do not attempt to overlap collective I/O and computation. (C) 2001 Academic Press.
引用
收藏
页码:1052 / 1076
页数:25
相关论文
共 26 条
[1]  
Baylor S. J., 1996, Input/output and parallel and distributed computer systems, P167
[2]  
Bordawekar R., 1993, Proceedings SUPERCOMPUTING '93, P452, DOI 10.1145/169627.169782
[3]  
BORDAWEKAR R, 1998, TR157 CACR CALTECH
[4]  
BORDAWEKAR R, 1997, P 11 ACM INT C SUP A
[5]  
CHO Y, 1997, P 5 WORKSH INP OUTP
[6]  
CRANDALL P, 1995, P SUP 95 DEC
[7]  
DELROSARIO JM, 1993, P IPPS 93 WORKSH INP, P56
[8]  
DICKENS P, LECT NOTES COMPUTER, V1470, P959
[9]  
FEITELSON D, 1997, TOPICS MODERN OPERAT
[10]  
FEITELSON D, 1994, 19760 RC IBM WATS RE