Improved Hadoop-based cloud for complex model simulation optimization: Calibration of SWAT as an example

被引:12
作者
Ma, Jinfeng [1 ]
Rao, Kaifeng [2 ]
Li, Ruonan [1 ]
Yang, Yanzheng [1 ]
Li, Weifeng [1 ]
Zheng, Hua [1 ,3 ]
机构
[1] Chinese Acad Sci, State Key Lab Urban & Reg Ecol, Res Ctr Ecoenvironm Sci, Shuangqing Rd 18, Beijing 100085, Peoples R China
[2] Chinese Acad Sci, State Key Joint Lab Environm Simulat & Pollut Con, Res Ctr Ecoenvironm Sci, Beijing 100085, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Hadoop-based cloud; Sequential model; Parallel computing; Partial failure; Simulation optimization; SWAT; MULTIOBJECTIVE EVOLUTIONARY ALGORITHM; NONPOINT-SOURCE POLLUTION; AUTOMATIC CALIBRATION; UNCERTAINTY ANALYSIS; WATER-QUALITY; MANAGEMENT-PRACTICES; HYDROLOGIC-MODELS; MULTIPLE; SOFTWARE; INPUT;
D O I
10.1016/j.envsoft.2022.105330
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A simulation optimization framework requires a substantial number of model simulations, which are computationally intensive and may be impractical when the model simulations are extremely time-consuming. This paper presents an improved Hadoop-based cloud framework to alleviate the computational burden of optimization. The framework parallelizes conventional sequential-model-based optimization techniques by concurrently orchestrating multiple model computations within Hadoop MapReduce. It guarantees the reliability of simulation optimization tasks by handling node failures without affecting the ongoing simulation. A case study, using Bayesian optimization to calibrate a SWAT model, achieved a speedup of nearly 55-58 when using 100 cores, demonstrating the efficiency of parallelizing the Bayesian optimization algorithm on the Hadoop-based cloud. Experiments in which computing nodes were dynamically increased or decreased demonstrated that the framework can automatically rebalance the workload across the remaining nodes. The framework is readily adaptable to other complex model applications that perform sequential-model-based optimizations or large-scale simulations.
引用
收藏
页数:12
相关论文
共 62 条
[1]   Modelling hydrology and water quality in the pre-alpine/alpine Thur watershed using SWAT [J].
Abbaspour, Karim C. ;
Yang, Jing ;
Maximov, Ivan ;
Siber, Rosi ;
Bogner, Konrad ;
Mieleitner, Johanna ;
Zobrist, Juerg ;
Srinivasan, Raghavan .
JOURNAL OF HYDROLOGY, 2007, 333 (2-4) :413-430
[2]  
Abbaspour KC, 2004, VADOSE ZONE J, V3, P1340
[3]   Particle Swarm Optimization for Automatic Calibration of Large Scale Water Quality Model (CE-QUAL-W2): Application to Karkheh Reservoir, Iran [J].
Afshar, Abbas ;
Kazemi, Hamideh ;
Saadatpour, Motahareh .
WATER RESOURCES MANAGEMENT, 2011, 25 (10) :2613-2632
[4]   Calibration and Validation of Watershed Models and Advances in Uncertainty Analysis in TMDL Studies [J].
Ahmadisharaf, Ebrahim ;
Camacho, Rene A. ;
Zhang, Harry X. ;
Hantush, Mohamed M. ;
Mohamoud, Yusuf M. .
JOURNAL OF HYDROLOGIC ENGINEERING, 2019, 24 (07)
[5]  
[Anonymous], 2009, Hadoop: The Definitive Guide: The Definitive Guide
[6]  
[Anonymous], 2010, Hadoop in action
[7]   Large area hydrologic modeling and assessment - Part 1: Model development [J].
Arnold, JG ;
Srinivasan, R ;
Muttiah, RS ;
Williams, JR .
JOURNAL OF THE AMERICAN WATER RESOURCES ASSOCIATION, 1998, 34 (01) :73-89
[8]   Pareto archived dynamically dimensioned search with hypervolume-based selection for multi-objective optimization [J].
Asadzadeh, Masoud ;
Tolson, Bryan .
ENGINEERING OPTIMIZATION, 2013, 45 (12) :1489-1509
[9]  
Auer P., 2003, Journal of Machine Learning Research, V3, P397, DOI 10.1162/153244303321897663
[10]  
Bacu V., 2017, 13 IEEE INT C INT CO