The Knowledge-Gradient Policy for Correlated Normal Beliefs

被引:267
作者
Frazier, Peter [1 ]
Powell, Warren [1 ]
Dayanik, Savas [1 ]
机构
[1] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
关键词
simulation: design of experiments; decision analysis: sequential; statistics; Bayesian; ORDINAL OPTIMIZATION; GLOBAL OPTIMIZATION; SIMULATION ALLOCATION; COMPUTER EXPERIMENTS; SELECTION; SYSTEMS; DESIGN; MODELS;
D O I
10.1287/ijoc.1080.0314
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We consider a Bayesian ranking and selection problem with independent normal rewards and a correlated multivariate normal belief on the mean values of these rewards. Because this formulation of the ranking and selection problem models dependence between alternatives' mean values, algorithms may use this dependence to perform efficiently even when the number of alternatives is very large. We propose a fully sequential sampling policy called the knowledge-gradient policy, which is provably optimal in some special cases and has bounded suboptimality in all others. We then demonstrate how this policy may be applied to efficiently maximize a continuous function on a continuous domain while constrained to a fixed number of noisy measurements.
引用
收藏
页码:599 / 613
页数:15
相关论文
共 41 条
[1]  
[Anonymous], 1978, Towards Global Optimization
[2]  
[Anonymous], 1968, APPL STAT DECISION T
[3]  
[Anonymous], STAT SPATIAL DATA
[4]  
Bechhofer R., 1995, DESIGN ANAL EXPT STA
[6]   Selecting a selection procedure [J].
Branke, Juergen ;
Chick, Stephen E. ;
Schmidt, Christian .
MANAGEMENT SCIENCE, 2007, 53 (12) :1916-1932
[7]   Simulation budget allocation for further enhancing the efficiency of ordinal optimization [J].
Chen, CH ;
Lin, JW ;
Yücesan, E ;
Chick, SE .
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2000, 10 (03) :251-270
[8]  
CHEN CH, 1996, WINT SIM C SAN DIEG, P398
[9]   Efficient dynamic simulation allocation in ordinal optimization [J].
Chen, Chun-Hung ;
He, Donghai ;
Fu, Michael .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (12) :2005-2009
[10]   New two-stage and sequential procedures for selecting the best simulated system [J].
Chick, SE ;
Inoue, K .
OPERATIONS RESEARCH, 2001, 49 (05) :732-743