Average cost;
Markov decision processes;
Dynamic programming;
Discrete time systems;
MARKOV DECISION-PROCESSES;
STABILIZATION;
FORMULATIONS;
THEOREM;
GROWTH;
D O I:
10.1016/j.jmaa.2022.126906
中图分类号:
O29 [应用数学];
学科分类号:
070104 ;
摘要:
This paper concerns optimal control problems for infinite-horizon discrete-time deterministic systems with the long-run average cost (AC) criterion. This optimality criterion can be traced back to a paper by Bellman [6] for a class of Markov decision processes (MDPs). We present a survey of some of the main approaches to study the AC problem, namely, the AC optimality (or dynamic programming) equation, the steady state approach, and the vanishing discount approach, emphasizing the difference between the deterministic control problem and the corresponding (stochastic) MDP. Several examples illustrate these approaches and related results. We also state some open problems. (c) 2022 Elsevier Inc. All rights reserved.