Curses, Tradeoffs, and Scalable Management: Advancing Evolutionary Multiobjective Direct Policy Search to Improve Water Reservoir Operations

被引:184
作者
Giuliani, Matteo [1 ]
Castelletti, Andrea [1 ,2 ]
Pianosi, Francesca [3 ]
Mason, Emanuele [1 ]
Reed, Patrick M. [4 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, Pza Leonardo da Vinci 32, I-20133 Milan, Italy
[2] ETH, Inst Environm Engn, Ramistr 101, CH-8092 Zurich, Switzerland
[3] Univ Bristol, Dept Civil Engn, Queens Bldg,Univ Walk, Bristol BS8 1TR, Avon, England
[4] Cornell Univ, Sch Civil & Environm Engn, Ithaca, NY 14853 USA
基金
英国自然环境研究理事会;
关键词
Water management; Direct policy search; Multiobjective evolutionary algorithm; UNIVERSAL APPROXIMATION; HYDROLOGIC INFORMATION; PARAMETRIC RULE; NEURAL-NETWORKS; OPTIMIZATION; RESOURCES; SYSTEM; IDENTIFICATION; PERFORMANCE; ALGORITHMS;
D O I
10.1061/(ASCE)WR.1943-5452.0000570
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Optimal management policies for water reservoir operation are generally designed via stochastic dynamic programming (SDP). Yet, the adoption of SDP in complex real-world problems is challenged by the three curses of dimensionality, modeling, and multiple objectives. These three curses considerably limit SDP's practical application. Alternatively, this study focuses on the use of evolutionary multiobjective direct policy search (EMODPS), a simulation-based optimization approach that combines direct policy search, nonlinear approximating networks, and multiobjective evolutionary algorithms to design Pareto-approximate closed-loop operating policies for multipurpose water reservoirs. This analysis explores the technical and practical implications of using EMODPS through a careful diagnostic assessment of the effectiveness and reliability of the overall EMODPS solution design as well as of the resulting Pareto-approximate operating policies. The EMODPS approach is evaluated using the multipurpose Hoa Binh water reservoir in Vietnam, where water operators are seeking to balance the conflicting objectives of maximizing hydropower production and minimizing flood risks. A key choice in the EMODPS approach is the selection of alternative formulations for flexibly representing reservoir operating policies. This study distinguishes between the relative performance of two widely-used nonlinear approximating networks, namely artificial neural networks (ANNs) and radial basis functions (RBFs). The results show that RBF solutions are more effective than ANN ones in designing Pareto approximate policies for the Hoa Binh reservoir. Given the approximate nature of EMODPS, the diagnostic benchmarking uses SDP to evaluate the overall quality of the attained Pareto-approximate results. Although the Hoa Binh test case's relative simplicity should maximize the potential value of SDP, the results demonstrate that EMODPS successfully dominates the solutions derived via SDP. (C) 2015 American Society of Civil Engineers.
引用
收藏
页数:17
相关论文
共 95 条
[1]  
[Anonymous], 2005, WATER RESOURCES SYST
[2]  
[Anonymous], 2003, C MO AP C M, DOI 10.1017/CBO9780511543241
[3]  
[Anonymous], 2007, EVOLUTIONARY ALGORIT
[4]  
[Anonymous], 2007, Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)
[5]   Should we build more large dams? The actual costs of hydropower megaproject development [J].
Ansar, Atif ;
Flyvbjerg, Bent ;
Budzier, Alexander ;
Lunn, Daniel .
ENERGY POLICY, 2014, 69 :43-56
[6]   Experiments with infinite-horizon, policy-gradient estimation [J].
Baxter, J ;
Bartlett, PL ;
Weaver, L .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2001, 15 :351-381
[7]  
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics
[8]  
Bertsekas D. P., 1996, NEURODYNAMIC PROGRAM
[9]  
Bertsekas D. P., 1976, DYNAMIC PROGRAMMING
[10]   Dynamic programming and suboptimal control: A survey from ADP to MPC [J].
Bertsekas, DP .
EUROPEAN JOURNAL OF CONTROL, 2005, 11 (4-5) :310-334