OPTIMAL STOPPING PROBLEMS FOR MULTIARMED BANDIT PROCESSES WITH ARMS INDEPENDENCE

被引:0
作者
YOSHIDA, Y
机构
[1] Faculty of Economics Kitakyushu University Kitagata, 802, Kokuraminami-ku Kitakyushu
关键词
MULTIARMED BANDIT PROBLEM; MULTIPARAMETER OPTIMAL STOPPING PROBLEM; MULTIPARAMETER PROCESSES; DYNAMIC ALLOCATION INDEX; LINEAR PROGRAMMING;
D O I
10.1016/0898-1221(93)90058-4
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper deals with the optimal stopping problem for multiarmed bandit processes. Under the assumption of independence of arms we show that optimal strategies and stopping times are expressed by the dynamic allocation indices for each arm. This paper reduces this problem to several independent one-parameter optimal stopping problems. On the basis of these results, we characterize optimal strategies and stopping times. Moreover, this paper also extends those to the case allowing time constraints. In the case where arm's state evolve according to Markov chains with finite state, linear programming calculation of optimal strategies and stopping times is discussed.
引用
收藏
页码:47 / 60
页数:14
相关论文
共 11 条
[1]   LINEAR-PROGRAMMING FOR FINITE STATE MULTIARMED BANDIT PROBLEMS [J].
CHEN, YR ;
KATEHAKIS, MN .
MATHEMATICS OF OPERATIONS RESEARCH, 1986, 11 (01) :180-183
[2]  
Derman C, 1970, FINITE STATE MARKOVI
[3]  
GITTINS JC, 1979, J ROY STAT SOC B MET, V41, P148
[4]  
GITTINS JC, 1972, PROGR STATISTICS EUR, P241
[5]   STOPPABLE FAMILIES OF ALTERNATIVE BANDIT PROCESSES [J].
GLAZEBROOK, KD .
JOURNAL OF APPLIED PROBABILITY, 1979, 16 (04) :843-854
[6]   OPTIMAL STOPPING AND SUPERMARTINGALES OVER PARTIALLY ORDERED SETS [J].
MANDELBAUM, A ;
VANDERBEI, RJ .
ZEITSCHRIFT FUR WAHRSCHEINLICHKEITSTHEORIE UND VERWANDTE GEBIETE, 1981, 57 (02) :253-264
[7]   DISCRETE MULTIARMED BANDITS AND MULTIPARAMETER PROCESSES [J].
MANDELBAUM, A .
PROBABILITY THEORY AND RELATED FIELDS, 1986, 71 (01) :129-147
[8]  
Neveu J., 1975, DISCRETE PARAMETER M
[9]  
SHIRYAEV AN, 1979, OPTIMAL STOPPING RUL
[10]   EXTENSIONS OF THE MULTIARMED BANDIT PROBLEM - THE DISCOUNTED CASE [J].
VARAIYA, PP ;
WALRAND, JC ;
BUYUKKOC, C .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1985, 30 (05) :426-439