Optimality inequalities for average cost Markov decision processes and the stochastic cash balance problem

被引:32
作者
Feinberg, Eugene A. [1 ]
Lewis, Mark E.
机构
[1] SUNY Stony Brook, Dept Appl Math & Stat, Stony Brook, NY 11794 USA
[2] Cornell Univ, Sch Operat Res & Ind Engn, Ithaca, NY 14853 USA
关键词
Markov decision process; average cost per unit time; optimality inequality; optimal policy; inventory control; INVENTORY CONTROL; POLICIES; RETURNS; MANAGEMENT; EXISTENCE; DISPOSAL;
D O I
10.1287/moor.1070.0269
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
For general state and action space Markov decision processes, we present sufficient conditions for the existence of solutions of the average cost optimality inequalities. These conditions also imply the convergence of both the optimal discounted cost value function and policies to the corresponding objects for the average costs per unit time case. Inventory models are natural applications of our results. We describe structural properties of average cost optimal policies for the cash balance problem; an inventory control problem where the demand may be negative and the decision-maker can produce or scrap inventory. We also show the convergence of optimal thresholds in the finite horizon case to those under the expected discounted cost criterion and those under the expected discounted costs to those under the average costs per unit time criterion.
引用
收藏
页码:769 / 783
页数:15
相关论文
共 31 条
[1]  
[Anonymous], STOCHASTIC DYNAMIC P
[2]  
[Anonymous], 2002, HDB MARKOV DECISION
[3]  
[Anonymous], 2000, DYNAMIC PROGRAMMING
[4]  
Bertsekas D.P., 2001, DYNAMIC PROGRAMMING, V2
[5]  
Bertsekas D. P., 1996, Stochastic Optimal Control: the Discrete-Time Case, V5
[6]   COMPARING RECENT ASSUMPTIONS FOR THE EXISTENCE OF AVERAGE OPTIMAL STATIONARY POLICIES [J].
CAVAZOSCADENA, R ;
SENNOTT, LI .
OPERATIONS RESEARCH LETTERS, 1992, 11 (01) :33-37
[7]   A COUNTEREXAMPLE ON THE OPTIMALITY EQUATION IN MARKOV DECISION CHAINS WITH THE AVERAGE COST CRITERION [J].
CAVAZOSCADENA, R .
SYSTEMS & CONTROL LETTERS, 1991, 16 (05) :387-392
[8]   Coordinating inventory control and pricing strategies with random demand and fixed ordering cost: The infinite horizon case [J].
Chen, X ;
Simchi-Levi, D .
MATHEMATICS OF OPERATIONS RESEARCH, 2004, 29 (03) :698-723
[9]  
CHEN X, 2003, NEW APPROACH STOCHAS
[10]   EXISTENCE OF OPTIMAL SIMPLE POLICIES FOR DISCOUNTED-COST INVENTORY AND CASH MANAGEMENT IN CONTINUOUS TIME [J].
CONSTANTINIDES, GM ;
RICHARD, SF .
OPERATIONS RESEARCH, 1978, 26 (04) :620-636