Optimality inequalities for average cost Markov decision processes and the stochastic cash balance problem

被引:32
作者
Feinberg, Eugene A. [1 ]
Lewis, Mark E.
机构
[1] SUNY Stony Brook, Dept Appl Math & Stat, Stony Brook, NY 11794 USA
[2] Cornell Univ, Sch Operat Res & Ind Engn, Ithaca, NY 14853 USA
关键词
Markov decision process; average cost per unit time; optimality inequality; optimal policy; inventory control; INVENTORY CONTROL; POLICIES; RETURNS; MANAGEMENT; EXISTENCE; DISPOSAL;
D O I
10.1287/moor.1070.0269
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
For general state and action space Markov decision processes, we present sufficient conditions for the existence of solutions of the average cost optimality inequalities. These conditions also imply the convergence of both the optimal discounted cost value function and policies to the corresponding objects for the average costs per unit time case. Inventory models are natural applications of our results. We describe structural properties of average cost optimal policies for the cash balance problem; an inventory control problem where the demand may be negative and the decision-maker can produce or scrap inventory. We also show the convergence of optimal thresholds in the finite horizon case to those under the expected discounted cost criterion and those under the expected discounted costs to those under the average costs per unit time criterion.
引用
收藏
页码:769 / 783
页数:15
相关论文
共 31 条
[11]   CASH BALANCE PROBLEM [J].
ELTON, EJ ;
GRUBER, MJ .
OPERATIONAL RESEARCH QUARTERLY, 1974, 25 (04) :553-572
[12]  
Eppen G.D., 1969, INT ECON REV, V10, P119, DOI DOI 10.2307/2525547
[13]   Optimality of four-threshold policies in inventory systems with customer returns and borrowing/storage options [J].
Feinberg, EA ;
Lewis, ME .
PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2005, 19 (01) :45-71
[14]  
FEINBERG EA, 2006, TR1442 CORN U
[15]  
FERNANDEZGAUCHERAND E, 1992, PROCEEDINGS OF THE 31ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P2179, DOI 10.1109/CDC.1992.371410
[16]   Controlling inventories with stochastic item returns: A basic model [J].
Fleischmann, M ;
Kuik, R ;
Dekker, R .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2002, 138 (01) :63-75
[17]  
GASKILL HS, 1998, ELEMENTS REAL ANAL
[18]   OPTIMAL CASH BALANCE LEVELS [J].
GIRGIS, NM .
MANAGEMENT SCIENCE, 1968, 15 (03) :130-140
[19]  
HEMANDEZLERMA O, 1996, DISCRETE TIME MARKOV
[20]  
HEMANDEZLERMA O, 1991, SYSTEMS CONTROL LETT, V17, P237