Restless bandit marginal productivity indices, diminishing returns, and optimal control of make-to-order/make-to-stock M/G/1 queues

被引：31

作者：

Niño-Mora, J ^{[1
]}

机构：

[1] Univ Carlos III Madrid, Dept Stat, E-28903 Getafe, Madrid, Spain

来源：

MATHEMATICS OF OPERATIONS RESEARCH | 2006年 / 31卷 / 01期

关键词：

restless bandits; stochastic scheduling; index policies; indexability; control by price; semi-Markov decision processes; dynamic resource allocation; diminishing returns; marginal productivity; efficient frontier; convex optimization; bias; mixed criteria; make to order; make to stock; control of queues; production-inventory control; partial conservation laws; achievable performance;

D O I：

10.1287/moor.1050.0165

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

This paper presents a framework grounded on convex optimization and economics ideas to solve by index policies problems of optimal dynamic allocation of effort to a discrete-state (finite or countable) binary-action (work/rest) semi-Markov restless bandit project, elucidating issues raised by previous work. Its contributions include: (i) the concept of a restless bandit's marginal productivity index (MPI), characterizing optimal policies relative to general cost and work measures; (ii) the characterization of indexable restless bandits as those satisfying diminishing marginal returns to work, consistently with a nested family of threshold policies; (iii) sufficient indexability conditions via partial conservation laws (PCLs); (iv) the characterization of the MPI as an optimal marginal productivity rate relative to feasible active-state sets; (v) application to semi-Markov bandits under several criteria, including a new mixed average-bias criterion; and (vi) PCL-indexability analyses and MPIs for optimal service control of make-to-order/make-to-stock queues with convex holding costs, under discounted and average-bias criteria.

引用

页码：50 / 84

页数：35

共 48 条

[1] [Anonymous], OPER RES
[2] Whittle's index policy for a multi-class queueing system with convex holding costs
P. S. Ansell
K. D. Glazebrook
J. Niño-Mora
M. O'Keeffe
[J]. Mathematical Methods of Operations Research, 2003, 57 (1) : 21 - 39
[3] CHARACTERIZATION AND COMPUTATION OF OPTIMAL POLICIES FOR OPERATING AN M/G/1 QUEUING SYSTEM WITH REMOVABLE SERVER
BELL, CE
[J]. OPERATIONS RESEARCH, 1971, 19 (01) : 208 - &
[4] Bertsekas D. P., 1987, DYNAMIC PROGRAMMING
[5] Conservation laws, extended polymatroids and multiarmed bandit problems; A polyhedral approach to indexable systems
Bertsimas, D
Nino-Mora, J
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 1996, 21 (02) : 257 - 306
[6] Restless bandits, linear programming relaxations, and a primal-dual index heuristic
Bertsimas, D
Niño-Mora, J
[J]. OPERATIONS RESEARCH, 2000, 48 (01) : 80 - 90
[7] DISCRETE DYNAMIC-PROGRAMMING
BLACKWELL, D
[J]. ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (02): : 719 - &
[8] Buzacott J.A., 1993, STOCHASTIC MODELS MA
[9] A CHARACTERIZATION OF WAITING TIME PERFORMANCE REALIZABLE BY SINGLE-SERVER QUEUES
COFFMAN, EG
MITRANI, I
[J]. OPERATIONS RESEARCH, 1980, 28 (03) : 810 - 821
[10] Cox D. R., 1961, QUEUES

← 1 2 3 4 5 →