Improved Hadoop-based cloud for complex model simulation optimization: Calibration of SWAT as an example

被引:10
作者
Ma, Jinfeng [1 ]
Rao, Kaifeng [2 ]
Li, Ruonan [1 ]
Yang, Yanzheng [1 ]
Li, Weifeng [1 ]
Zheng, Hua [1 ,3 ]
机构
[1] Chinese Acad Sci, State Key Lab Urban & Reg Ecol, Res Ctr Ecoenvironm Sci, Shuangqing Rd 18, Beijing 100085, Peoples R China
[2] Chinese Acad Sci, State Key Joint Lab Environm Simulat & Pollut Con, Res Ctr Ecoenvironm Sci, Beijing 100085, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Hadoop-based cloud; Sequential model; Parallel computing; Partial failure; Simulation optimization; SWAT; MULTIOBJECTIVE EVOLUTIONARY ALGORITHM; NONPOINT-SOURCE POLLUTION; AUTOMATIC CALIBRATION; UNCERTAINTY ANALYSIS; WATER-QUALITY; MANAGEMENT-PRACTICES; HYDROLOGIC-MODELS; MULTIPLE; SOFTWARE; INPUT;
D O I
10.1016/j.envsoft.2022.105330
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A simulation optimization framework requires a substantial number of model simulations, which are computationally intensive and may be impractical when the model simulations are extremely time-consuming. This paper presents an improved Hadoop-based cloud framework to alleviate the computational burden of optimization. The framework parallelizes conventional sequential-model-based optimization techniques by concurrently orchestrating multiple model computations within Hadoop MapReduce. It guarantees the reliability of simulation optimization tasks by handling node failures without affecting the ongoing simulation. A case study, using Bayesian optimization to calibrate a SWAT model, achieved a speedup of nearly 55-58 when using 100 cores, demonstrating the efficiency of parallelizing the Bayesian optimization algorithm on the Hadoop-based cloud. Experiments in which computing nodes were dynamically increased or decreased demonstrated that the framework can automatically rebalance the workload across the remaining nodes. The framework is readily adaptable to other complex model applications that perform sequential-model-based optimizations or large-scale simulations.
引用
收藏
页数:12
相关论文
共 62 条
  • [1] Modelling hydrology and water quality in the pre-alpine/alpine Thur watershed using SWAT
    Abbaspour, Karim C.
    Yang, Jing
    Maximov, Ivan
    Siber, Rosi
    Bogner, Konrad
    Mieleitner, Johanna
    Zobrist, Juerg
    Srinivasan, Raghavan
    [J]. JOURNAL OF HYDROLOGY, 2007, 333 (2-4) : 413 - 430
  • [2] Abbaspour KC, 2004, VADOSE ZONE J, V3, P1340
  • [3] Particle Swarm Optimization for Automatic Calibration of Large Scale Water Quality Model (CE-QUAL-W2): Application to Karkheh Reservoir, Iran
    Afshar, Abbas
    Kazemi, Hamideh
    Saadatpour, Motahareh
    [J]. WATER RESOURCES MANAGEMENT, 2011, 25 (10) : 2613 - 2632
  • [4] Calibration and Validation of Watershed Models and Advances in Uncertainty Analysis in TMDL Studies
    Ahmadisharaf, Ebrahim
    Camacho, Rene A.
    Zhang, Harry X.
    Hantush, Mohamed M.
    Mohamoud, Yusuf M.
    [J]. JOURNAL OF HYDROLOGIC ENGINEERING, 2019, 24 (07)
  • [5] [Anonymous], 2009, Hadoop: The Definitive Guide O'Reilly
  • [6] [Anonymous], 2010, Hadoop in action
  • [7] Large area hydrologic modeling and assessment - Part 1: Model development
    Arnold, JG
    Srinivasan, R
    Muttiah, RS
    Williams, JR
    [J]. JOURNAL OF THE AMERICAN WATER RESOURCES ASSOCIATION, 1998, 34 (01): : 73 - 89
  • [8] Pareto archived dynamically dimensioned search with hypervolume-based selection for multi-objective optimization
    Asadzadeh, Masoud
    Tolson, Bryan
    [J]. ENGINEERING OPTIMIZATION, 2013, 45 (12) : 1489 - 1509
  • [9] Auer P., 2003, Journal of Machine Learning Research, V3, P397, DOI 10.1162/153244303321897663
  • [10] Bacu V., 2017, 13 IEEE INT C INT CO