Phoenix: a parallel programming model for accommodating dynamically joining/leaving resources

被引:0
作者
Taura, K
Kaneda, K
Endo, T
Yonezawa, A
机构
[1] Univ Tokyo, Bunkyo Ku, Tokyo 1130033, Japan
[2] PRESTO, JST, Kawaguchi, Saitama 3320012, Japan
关键词
performance; parallel programming; distributed programming; message passing; migration; resource reconfiguration;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes Phoenix, a programming model for writing parallel and distributed applications that accommodate dynamically joining/leaving compute resources. In the proposed model, nodes involved in an application see a large and fixed virtual node name space. They communicate via messages, whose destinations are specified by virtual node names, rather than names bound to a physical resource. We describe Phoenix API and show how it allows a transparent migration of application states, as well as dynamically joining/leaving nodes as its by-product. We also demonstrate through several application studies that Phoenix model is close enough to regular message passing, thus it is a general programming model that facilitates porting many parallel applications/algorithms to more dynamic environments. Experimental results indicate applications that have a small task migration cost can quickly take advantage of dynamically joining resources using Phoenix. Divide-and-conquer algorithms written in Phoenix achieved a good speedup with a large number of nodes across multiple LANs (120 times speedup using 169 CPUs across three LANs). We believe Phoenix provides a useful programming abstraction and platform for emerging parallel applications that must be deployed across multiple LANs and/or shared clusters having dynamically varying resource conditions.
引用
收藏
页码:215 / 228
页数:14
相关论文
共 33 条
[1]  
BEGUELIN A, 1994, PVM PARALLEL VIRTUAL
[2]   Scheduling multithreaded computations by work stealing [J].
Blumofe, RD ;
Leiserson, CE .
JOURNAL OF THE ACM, 1999, 46 (05) :720-748
[3]  
BOSILCA G, 2002, SC 2002
[4]  
BUYYA R, 2000, HIGH PERFORMANCE COM, P283
[5]  
Cheng C., 1989, ACM SIGCOMM 89, V19-4, P224
[6]  
Fagg GE, 2000, LECT NOTES COMPUT SC, V1908, P346
[7]  
FEELEY M, 1993, LECT NOTES COMPUTER, V748, P94
[8]  
FOSTER I, 1998, SC 1998
[9]  
Foster I, 1999, GRID BLUEPRINT NEW C
[10]  
FRIGO M, 1998, ACM PLDI