Technical note-Knowledge gradient for selection with covariates: Consistency and computation

被引:15
作者
Ding, Liang [1 ]
Hong, L. Jeff [2 ,3 ]
Shen, Haihui [4 ]
Zhang, Xiaowei [5 ]
机构
[1] Texas A&M Univ, Dept Ind Syst Engn, College Stn, TX USA
[2] Fudan Univ, Sch Management, Shanghai, Peoples R China
[3] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
[4] Shanghai Jiao Tong Univ, Antai Coll Econ & Management, Sino US Global Logist Inst, Shanghai, Peoples R China
[5] Univ Hong Kong, Fac Business & Econ, Pok Fu Lam, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
consistency; covariates; knowledge gradient; selection of the best; MULTIARMED BANDIT PROBLEM; BAYESIAN OPTIMIZATION; COST-EFFECTIVENESS; SIMULATION; CHEMOPREVENTION; ASPIRIN;
D O I
10.1002/nav.22028
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Knowledge gradient is a design principle for developing Bayesian sequential sampling policies to solve optimization problems. In this paper, we consider the ranking and selection problem in the presence of covariates, where the best alternative is not universal but depends on the covariates. In this context, we prove that under minimal assumptions, the sampling policy based on knowledge gradient is consistent, in the sense that following the policy the best alternative as a function of the covariates will be identified almost surely as the number of samples grows. We also propose a stochastic gradient ascent algorithm for computing the sampling policy and demonstrate its performance via numerical experiments.
引用
收藏
页码:496 / 507
页数:12
相关论文
共 34 条
[1]   STOCHASTIC KRIGING FOR SIMULATION METAMODELING [J].
Ankenman, Bruce ;
Nelson, Barry L. ;
Staum, Jeremy .
2008 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2008, :362-370
[2]  
[Anonymous], 2007, Springer Monographs in Mathematics
[3]  
[Anonymous], 2017, ADV NEURAL INFORM PR
[4]   Putting one-to-one marketing to work: Personalization, customization, and choice [J].
Arora, Neeraj ;
Dreze, Xavier ;
Ghose, Anindya ;
Hess, James D. ;
Iyengar, Raghuram ;
Jing, Bing ;
Joshi, Yogesh ;
Kumar, V. ;
Lurie, Nicholas ;
Neslin, Scott ;
Sajeesh, S. ;
Su, Meng ;
Syam, Niladri ;
Thomas, Jacquelyn ;
Zhang, Z. John .
MARKETING LETTERS, 2008, 19 (3-4) :305-321
[5]   A supermartingale approach to Gaussian process based sequential design of experiments [J].
Bect, Julien ;
Bachoc, Francois ;
Ginsbourger, David .
BERNOULLI, 2019, 25 (4A) :2883-2919
[6]   Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems [J].
Bubeck, Sebastien ;
Cesa-Bianchi, Nicolo .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 5 (01) :1-122
[7]  
Chen CH, 2015, INT SER OPER RES MAN, V216, P45, DOI 10.1007/978-1-4939-1384-8_3
[8]   Statins and Aspirin for Chemoprevention in Barrett's Esophagus: Results of a Cost-Effectiveness Analysis [J].
Choi, Sung Eun ;
Perzan, Katherine E. ;
Tramontano, Angela C. ;
Kong, Chung Yin ;
Hur, Chin .
CANCER PREVENTION RESEARCH, 2014, 7 (03) :341-350
[9]   The Knowledge-Gradient Policy for Correlated Normal Beliefs [J].
Frazier, Peter ;
Powell, Warren ;
Dayanik, Savas .
INFORMS JOURNAL ON COMPUTING, 2009, 21 (04) :599-613
[10]   A KNOWLEDGE-GRADIENT POLICY FOR SEQUENTIAL INFORMATION COLLECTION [J].
Frazier, Peter I. ;
Powell, Warren B. ;
Dayanik, Savas .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (05) :2410-2439