Exact Dynamic Programming for Positive Systems With Linear Optimal Cost

被引:1
作者
Li, Yuchao [1 ,2 ]
Rantzer, Anders [3 ,4 ]
机构
[1] KTH Royal Inst Technol, Div Decis & Control Syst, S-11428 Stockholm, Sweden
[2] Arizona State Univ, Sch Comp & Augmented Intelligence, Tempe, AZ 85281 USA
[3] Lund Univ, Dept Automat Control, SE-22100 Lund, Sweden
[4] KTH Royal Inst Technol, Excellence Ctr ELLIIT & Wallenberg AI, Autonomous Syst & Software Program, S-11428 Stockholm, Sweden
基金
欧洲研究理事会;
关键词
Costs; Vectors; Optimal control; Linear systems; Nonlinear equations; Cost function; Dynamic programming; optimal control; positive linear system; stability of linear systems; STOCHASTIC-CONTROL;
D O I
10.1109/TAC.2024.3420716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent work (Rantzer, 2022) formulated a class of optimal control problems involving positive linear systems, linear stage costs, and elementwise constraints on control. It was shown that the problem admits linear optimal cost and the associated Bellman's equation can be characterized by a finite-dimensional nonlinear equation, which is solved by linear programming. In this work, we report exact dynamic programming (DP) theories for the same class of problems. Moreover, we extend the results to a related class of problems where the norms of control are bounded while the optimal costs remain linear. In both cases, we provide conditions under which the solutions are unique, investigate properties of the optimal policies, study the convergence of value iteration, policy iteration, and optimistic policy iteration applied to such problems, and analyze the boundedness of the solution to the associated optimization programs. Apart from a form of the Frobenius-Perron theorem, the majority of our results are built upon generic DP theory applicable to problems involving nonnegative stage costs.
引用
收藏
页码:8738 / 8750
页数:13
相关论文
共 28 条
  • [1] [Anonymous], 1997, Convex Analysis
  • [2] [Anonymous], 1979, Introduction to Dynamic Systems: Theory, Models, and Applications
  • [3] Stochastic Control With Affine Dynamics and Extended Quadratic Costs
    Barratt, Shane
    Boyd, Stephen
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (01) : 320 - 335
  • [4] DYNAMIC PROGRAMMING
    BELLMAN, R
    [J]. SCIENCE, 1966, 153 (3731) : 34 - &
  • [5] THE THEORY OF DYNAMIC PROGRAMMING
    BELLMAN, R
    [J]. BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1954, 60 (06) : 503 - 515
  • [6] Bertsekas D, 2009, Convex optimization theory, V1
  • [7] Bertsekas D., 2022, Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control
  • [8] Bertsekas D. P., 1975, Proceedings of the 1975 IEEE Conference on Decision Control including the 14th Symposium on Adaptive Processes, P20
  • [9] Bertsekas D. P., 1987, DYNAMIC PROGRAMMING
  • [10] Bertsekas D.P., 2017, Dynamic Programming and Optimal Control, V1