ASKALON: a tool set for cluster and Grid computing

被引:100
作者
Fahringer, T
Jugravu, A
Pllana, S
Prodan, R
Seragiotto, CJ
Truong, HL
机构
[1] Univ Innsbruck, Inst Comp Sci, A-6020 Innsbruck, Austria
[2] Univ Vienna, Inst Software Sci, A-1090 Vienna, Austria
关键词
cluster computing; Grid computing; parallel and distributed applications; performance prediction; measurement and analysis; bottleneck detection; experiment management;
D O I
10.1002/cpe.929
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Performance engineering of parallel and distributed applications is a complex task that iterates through various phases, ranging from modeling and prediction, to performance measurement, experiment management, data collection, and bottleneck analysis. There is no evidence so far that all of these phases should/can be integrated into a single monolithic tool. Moreover, the emergence of computational Grids as a common single wide-area platform for high-performance computing raises the idea to provide tools as interacting Grid services that share resources, support interoperability among different users and tools, and, most importantly, provide omnipresent services over the Grid. We have developed the ASKALON tool set to support performance-oriented development of parallel and distributed (Grid) applications. ASKALON comprises four tools, coherently integrated into a service-oriented architecture. SCALEA is a performance instrumentation, measurement, and analysis tool of parallel and distributed applications. ZENTURIO is a general purpose experiment management tool with advanced support for multi-experiment performance analysis and parameter studies. AKSUM provides semi-automatic highlevel performance bottleneck detection through a special-purpose performance property specification language. The PerformanceProphet enables the user to model and predict the performance of parallel applications at the early stages of development. In this paper we describe the overall architecture of the ASKALON tool set and outline the basic functionality of the four constituent tools. The structure of each tool is based on the composition and sharing of remote Grid services, thus enabling tool interoperability. In addition, a data repository allows the tools to share the common application performance and output data that have been derived by the individual tools. A service repository is used to store common portable Grid service implementations. A general-purpose Factory service is employed to create service instances on arbitrary remote Grid sites. Discovering and dynamically binding to existing remote services is achieved through registry services. The ASKALON visualization diagrams support both online and postmortem visualization of performance and output data. We demonstrate the usefulness and effectiveness of ASKALON by applying the tools to real-world applications. Copyright (C) 2005 John Wiley Sons, Ltd.
引用
收藏
页码:143 / 169
页数:27
相关论文
共 61 条
  • [31] GERNT M, 2002, EUR WORKSH PAR DISTR
  • [32] Grosso W., 2002, JAVA RMI
  • [33] Harold ER., 1998, XML EXTENSIBLE MARKU
  • [34] HERZOG R, POSTGRE SQL LINUX DA
  • [35] JOANNIDIS YE, 1996, P 22 INT C VER LARG, P274
  • [36] KARAVANIC KL, 1997, P SC97 C SAN JOS NOV
  • [37] KRASNER GE, 1988, J OBJECT-ORIENT PROG, V1, P41
  • [38] LINTHICUM DS, 1995, OPEN COMPUTING, V12, P68
  • [39] Litzkow M. J., 1988, 8th International Conference on Distributed Computing Systems (Cat. No.88CH2541-1), P104, DOI 10.1109/DCS.1988.12507
  • [40] Malony AD, 2000, KLUWER INT SER ENG C, V567, P37