Towards autonomic data management for staging-based coupled scientific workflows

被引:5
作者
Jin T. [1 ]
Zhang F. [1 ]
Sun Q. [1 ]
Romanus M. [1 ]
Bui H. [1 ]
Parashar M. [1 ]
机构
[1] Rutgers Discovery Informatics Institute, Piscataway, 08854, NJ
基金
美国国家科学基金会;
关键词
Autonomic computing; Data management; Data staging; HPC workflow; In-situ;
D O I
10.1016/j.jpdc.2020.07.002
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Emerging scientific workflows running at extreme scale are composed of multiple applications that interact and exchange data at runtime. While staging-based approaches, e.g. in-situ/in-transit processing, are promising, dynamic behaviors (e.g. data volumes and distributions) in coupled applications and varying resource constraints at runtime make the efficient use of these techniques challenging. Addressing these challenges requires fundamental changes in the way that workflows are executed at runtime. Specifically, it is required to monitor the operating environment and running applications, and then adapt and tune the application behaviors and resource allocations at runtime while meeting the data management requirements and constraints. In this paper, we propose a policy-based autonomic data management (ADM) approach that can adaptively respond at runtime to dynamic data management requirements. We first formulate the schematic abstraction of this ADM approach including its conceptual model and system elements. Then, we explore the realization of ADM runtime and demonstrate how to achieve adaptations in a cross-layer manner with pre-defined autonomic policies. We also prototype our ADM approach and evaluate its performance on the Intrepid IBM-BlueGene and Titan Cray-XK7 systems using Chombo-based AMR applications and a visualization application. The experimental results demonstrate its effectiveness in meeting user defined objectives and accelerating overall scientific discovery. © 2020 Elsevier Inc.
引用
收藏
页码:35 / 51
页数:16
相关论文
共 46 条
[1]  
Abbasi H., Wolf M., Eisenhauer G., Klasky S., Schwan K., Zheng F., Datastager: scalable data staging services for petascale applications, (2009)
[2]  
Abdelzaher T., Shin K., Bhatti N., Performance guarantees for web server end-systems: a control-theoretical approach, IEEE Trans. Parallel Distrib. Syst., 13, 1, pp. 80-96, (2002)
[3]  
Amer M.A., Chervenak A., Chen W., Improving scientific workflow performance using policy based data placement, (2012)
[4]  
Ayachit U., Bauer A., Duque E.P.N., Eisenhauer G., Ferrier N., Gu J., Jansen K.E., Loring B., Lukic Z., Menon S., Morozov D., O'Leary P., Ranjan R., Rasquin M., Stone C.P., Vishwanath V., Weber G.H., Whitlock B., Wolf M., Wu K.J., Bethel E.W., Performance analysis, design considerations, and applications of extreme-scale in situ infrastructures, (2016)
[5]  
Bennett J.C., Abbasi H., Bremer P.-T., Grout R.W., Gyulassy A., Jin T., Klasky S., Kolla H., Parashar M., Pascucci V., Pebay P., Thompson D., Yu H., Zhang F., Chen J., Combining in-situ and in-transit processing to enable extreme-scale scientific analysis, (2012)
[6]  
Chervenak A.L., Smith D.E., Chen W., Deelman E., Integrating policy with scientific workflow management for data-intensive applications, (2012)
[7]  
Docan C., Parashar M., Cummings J., Klasky S., Moving the code to the data - dynamic code deployment using activespaces, (2011)
[8]  
Docan C., Parashar M., Klasky S., DART: A substrate for high speed asynchronous data IO, (2008)
[9]  
Docan C., Parashar M., Klasky S., DataSpaces: An interaction and coordination framework for coupled simulation workflows, (2010)
[10]  
Docan C., Parashar M., Klasky S., Dataspaces: an interaction and coordination framework forcoupled simulation workflows, Cluster Comput., (2012)