Dynamic allocation indices for restless projects and queueing admission control:: a polyhedral approach

被引:60
作者
Niño-Mora, J [1 ]
机构
[1] Univ Pompeu Fabra, Dept Econ & Business, Barcelona 08005, Spain
关键词
Markov decision process; restless bandits; polyhedral combinatorics; extended polymatroid; adaptive-greedy algorithm; dynamic allocation index; stochastic scheduling; threshold policy; index policy; Gittins index; Klimov index; Whittle index; control of queues; admission control; routing; make-to-stock; multiclass queue; finite buffers; conservation laws; achievable performance;
D O I
10.1007/s10107-002-0362-6
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper develops a polyhedral approach to the design, analysis, and computation of dynamic allocation indices for scheduling binary-action (engage/rest) Markovian stochastic projects which can change state when rested (restless bandits (RBs)), based on partial conservation laws (PCLs). This extends previous work by the author [J. Nino-Mora (2001): Restless bandits, partial conservation laws and indexability. Adv. Appl. Probab. 33, 76-98], where PCLs were shown to imply the optimality of index policies with a postulated structure in stochastic scheduling problems, under admissible linear objectives, and they were deployed to obtain simple sufficient conditions for the existence of Whittle's (1988) RB index (indexability), along with an adaptive-greedy index algorithm. The new contributions include: (i) we develop the polyhedral foundation of the PCL framework, based on the structural and algorithmic properties of a new polytope associated with an accessible set system (J, F) (T-extended polymatroid); (ii) we present new dynamic allocation indices for RBs, motivated by an admission control model, which extend Whittle's and have a significantly increased scope; (iii) we deploy PCLs to obtain both sufficient conditions for the existence of the new indices (PCL-indexability), and a new adaptive-greedy index algorithm; (iv) we interpret PCL-indexability as a form of the classic economics law of diminishing marginal returns, and characterize the index as an optimal marginal cost rate; we further solve a related optimal constrained control problem; (v) we carry out a PCL-indexability analysis of the motivating admission control model, under time-discounted and long-run average criteria; this gives, under mild conditions, a new index characterization of optimal threshold policies; and (vi) we apply the latter to present new heuristic index policies for two hard queueing control problems: admission control and routing to parallel queues; and scheduling a multiclass make-to-stock queue with lost sales, both under state-dependent holding cost rates and birth-death dynamics.
引用
收藏
页码:361 / 413
页数:53
相关论文
共 37 条