Dynamic DAG scheduling for many-task computing of distributed eco-hydrological model

被引:5
作者
Yue, Shasha [1 ,2 ]
Ma, Yan [1 ]
Chen, Lajiao [1 ]
Wang, Yuzhu [1 ]
Song, Weijing [3 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Hydrological model; Parallel computing; DAG scheduling; Many-task computing; PARALLELIZATION; DUPLICATION; ALGORITHM; OPTIMIZATION; COMPUTATION; SIMULATION; GRAPHS;
D O I
10.1007/s11227-017-2047-1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The computing of distributed hydrological model at large scale is increasingly characterized by data intensive and computation intensive, especially for the multi-process coupling model. Parallel computing is one effective approach to cope with this situation. The easily extensible fine-grained parallelization method can substantially improve the computing efficiency. Based on many-task computing, we proposed a parallel scheme that the whole computing of the distributed hydrological model is split into tremendous amount of small sub-tasks which are directly dispatched into the cluster nodes by the traditional local resource managers (LRMs). The task-splitting method, the single task model and the representation of dependencies between tasks are also proposed. In order to efficiently schedule so many tasks, a dynamic DAG scheduling method based on critical path and depth is provided. The management of intermediate file, the control strategy for LRMs and the fault recovery is also introduced to deal with the problems encountered in the actual parallel implementation process. The parallel scheme is tested with an optimality-based distributed eco-hydrological model (disVOM) in the Poyang Lake sub-basin. It is demonstrated that our approach provide efficient computing performance.
引用
收藏
页码:510 / 532
页数:23
相关论文
共 70 条
[1]   On exploiting task duplication in parallel program scheduling [J].
Ahmad, I ;
Kwok, YK .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1998, 9 (09) :872-892
[2]  
[Anonymous], 2001, - Using ArcGIS Geostatistical Analyst
[3]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness
[4]   Parallel computation for streamflow prediction with distributed hydrologic models [J].
Apostolopoulos, TK ;
Georgakakos, KP .
JOURNAL OF HYDROLOGY, 1997, 197 (1-4) :1-24
[5]   An improved duplication strategy for scheduling precedence constrained graphs in multiprocessor systems [J].
Bansal, S ;
Kumar, P ;
Singh, K .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2003, 14 (06) :533-544
[6]   Hyper-resolution global hydrological modelling: what is next? "Everywhere and locally relevant" [J].
Bierkens, Marc F. P. ;
Bell, Victoria A. ;
Burek, Peter ;
Chaney, Nathaniel ;
Condon, Laura E. ;
David, Cedric H. ;
de Roo, Ad ;
Doell, Petra ;
Drost, Niels ;
Famiglietti, James S. ;
Floerke, Martina ;
Gochis, David J. ;
Houser, Paul ;
Hut, Rolf ;
Keune, Jessica ;
Kollet, Stefan ;
Maxwell, Reed M. ;
Reager, John T. ;
Samaniego, Luis ;
Sudicky, Edward ;
Sutanudjaja, Edwin H. ;
van de Giesen, Nick ;
Winsemius, Hessel ;
Wood, Eric F. .
HYDROLOGICAL PROCESSES, 2015, 29 (02) :310-320
[7]  
Bode B, 2000, USENIX EXTR LIN TECH
[8]  
Borah DK, 2003, T ASAE, V46, P1553, DOI [10.13031/2013.16110, 10.13031/2013.15644]
[9]  
Catlett C., 2008, Adv. Parallel Comput, P225
[10]   Overview of Ecohydrological Models and Systems at the Watershed Scale [J].
Chen, Lajiao ;
Wang, Lizhe ;
Ma, Yan ;
Liu, Peng .
IEEE SYSTEMS JOURNAL, 2015, 9 (03) :1091-1099