The Internet Operating System: Middleware for adaptive distributed computing

被引:32
作者
El Maghraoui, Kaoutar [1 ]
Desell, Travis J. [1 ]
Szymanski, Boleslaw K. [1 ]
Varela, Carlos A. [1 ]
机构
[1] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12180 USA
关键词
grid computing; middleware; resource management; dynamic reconfiguration;
D O I
10.1177/1094342006068411
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale, dynamic, and heterogeneous networks of computational resources (a.k.a. grids) promise to provide high performance and scalability to computationally intensive applications. To fulfill this promise, grid environments require complex resource management. We propose decentralized middleware-triggered dynamic reconfiguration strategies to enable application adaptation to the constantly changing resource availability of Internet-scale shared computational grids. As a proof of concept, we present a software framework for dynamically reconfigurable distributed applications. The Internet Operating System (IOS) is a middleware infrastructure which aims at freeing application developers from dealing with non-functional concerns while seeking to optimize application performance and global resource utilization. IOS consists of distributed middleware agents that are capable of interconnecting themselves in various virtual peer-to-peer topologies. IOS middleware agents: 1) profile application communication patterns; 2) evaluate the dynamics of the underlying physical resources; and 3) reconfigure application components by changing their mappings to physical resources through migration and by changing their granularity through a split and merge mechanism. A key characteristic of IOS is its decentralized coordination, thereby avoiding the use of global knowledge and thus enabling scalable reconfiguration. The IOS middleware is programming model-independent: we have implemented an actor programming model interface for SALSA programs and also a process programming model interface for MPI programs. Experimental results show that adaptive middleware can be an effective approach to reconfiguring distributed applications with various ratios of communication to computation in order to improve their performance, and more effectively utilize grid resources.
引用
收藏
页码:467 / 480
页数:14
相关论文
共 35 条
[1]  
AGHA G, 2004, PRACTICAL HDB INTERN
[2]  
Agha Gul, 1986, Actors: A Model of Concurrent Computation in Distributed Systems
[3]   The cactus worm: Experiments with dynamic resource discovery and allocation in a grid environment [J].
Allen, G ;
Angulo, D ;
Foster, I ;
Lanfermann, G ;
Liu, C ;
Radke, T ;
Seidel, E ;
Shalf, J .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2001, 15 (04) :345-358
[4]  
ANGULO D, 2002, IEEE INT S HIGH PERF
[5]  
ANTONIU G, 2006, IN PRESS CONCURRENCY, V17
[6]   The GrADS project: Software support for high-level grid application development [J].
Berman, F ;
Chien, A ;
Cooper, K ;
Dongarra, J ;
Foster, I ;
Gannon, D ;
Johnsson, L ;
Kennedy, K ;
Kesselman, C ;
Mellor-Crummey, J ;
Reed, D ;
Torczon, L ;
Wolski, R .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2001, 15 (04) :327-344
[7]   Scheduling multithreaded computations by work stealing [J].
Blumofe, RD ;
Leiserson, CE .
JOURNAL OF THE ACM, 1999, 46 (05) :720-748
[8]  
BOUTEILLER A, 2003, SUPERCOMPUTING 2003
[9]  
CZAJKOWSKI K, 2001, 10 IEEE INT S HIGH P
[10]   LOAD BALANCING FOR THE PARALLEL ADAPTIVE SOLUTION OF PARTIAL-DIFFERENTIAL EQUATIONS [J].
DECOUGNY, HL ;
DEVINE, KD ;
FLAHERTY, JE ;
LOY, RM ;
OZTURAN, C ;
SHEPHARD, MS .
APPLIED NUMERICAL MATHEMATICS, 1994, 16 (1-2) :157-182