Economic MPC of Markov Decision Processes: Dissipativity in undiscounted infinite-horizon optimal control

被引:10
作者
Gros, Sebastien [1 ]
Zanon, Mario [2 ]
机构
[1] NTNU, Fac Informat Technol, Dept Eng Cybernet, Trondheim, Norway
[2] IMT Sch Adv Studies Lucca, Piazza San Francesco 19, I-55100 Lucca, Italy
关键词
Markov Decision Processes; Dissipativity for economic MPC; Storage functions; Economic costs; MODEL-PREDICTIVE CONTROL; SYSTEMS;
D O I
10.1016/j.automatica.2022.110602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Economic Model Predictive Control (MPC) dissipativity theory is central to discussing the stability of policies resulting from minimizing economic stage costs. In its current form, the dissipativity theory for economic MPC applies to problems based on deterministic dynamics or to very specific classes of stochastic problems, and does not readily extend to generic Markov decision processes. In this paper, we clarify the core reason for this difficulty, and propose a generalization of the economic MPC dissipativity theory that circumvents it. This generalization focuses on undiscounted infinite-horizon problems and is based on nonlinear stage cost functionals, allowing one to discuss the Lyapunov asymptotic stability of policies for Markov decision processes in terms of the probability measures underlying their stochastic dynamics. This theory is illustrated for the stochastic linear quadratic regulator with Gaussian process noise, for which a storage functional can be provided explicitly. For the sake of brevity, we limit our discussion to undiscounted Markov decision processes.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 24 条
  • [1] Economic optimization using model predictive control with a terminal cost
    Amrit, Rishi
    Rawlings, James B.
    Angeli, David
    [J]. ANNUAL REVIEWS IN CONTROL, 2011, 35 (02) : 178 - 186
  • [2] On optimal system operation in robust economic MPC
    Bayer, Florian A.
    Mueller, Matthias A.
    Allgoewer, Frank
    [J]. AUTOMATICA, 2018, 88 : 98 - 106
  • [3] Robust economic Model Predictive Control using stochastic information
    Bayer, Florian A.
    Lorenzen, Matthias
    Mueller, Matthias A.
    Allgoewer, Frank
    [J]. AUTOMATICA, 2016, 74 : 151 - 161
  • [4] Systems with persistent disturbances: predictive control with restricted constraints
    Chisci, L
    Rossiter, JA
    Zappa, G
    [J]. AUTOMATICA, 2001, 37 (07) : 1019 - 1028
  • [5] A Lyapunov Function for Economic Optimizing Model Predictive Control
    Diehl, Moritz
    Amrit, Rishi
    Rawlings, James B.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (03) : 703 - 707
  • [6] Economic Nonlinear Model Predictive Control
    Faulwasser, Timm
    Gruene, Lars
    Mueller, Matthias A.
    [J]. FOUNDATIONS AND TRENDS IN SYSTEMS AND CONTROL, 2018, 5 (01): : 1 - 98
  • [7] Gros S, 2013, IEEE DECIS CONTR P, P1001, DOI 10.1109/CDC.2013.6760013
  • [8] Economic receding horizon control without terminal constraints
    Gruene, Lars
    [J]. AUTOMATICA, 2013, 49 (03) : 725 - 734
  • [9] Grune L, 2017, COMMUN CONTROL ENG, P1, DOI 10.1007/978-3-319-46024-6
  • [10] Hult Robert, 2018, 2018 European Control Conference (ECC), P602, DOI 10.23919/ECC.2018.8550367