Maximum Entropy Optimal Control of Continuous-Time Dynamical Systems

被引:6
|
作者
Kim, Jeongho [1 ,2 ]
Yang, Insoon [3 ,4 ]
机构
[1] Seoul Natl Univ, Seoul 08826, South Korea
[2] Korea Inst Adv Study, Seoul 02455, South Korea
[3] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 08826, South Korea
[4] Seoul Natl Univ, Automat & Syst Res Inst, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
Dynamic programming (DP); entropy; Hamilton-Jacobi-Bellman (HJB) equations; optimal control; viscosity solution; VISCOSITY SOLUTIONS; RELAXED CONTROLS; EQUATIONS; DIMENSIONALITY; ALGORITHM; CURSE;
D O I
10.1109/TAC.2022.3168168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Maximum entropy reinforcement learning methods have been successfully applied to a range of challenging sequential decision-making and control tasks. However, most of the existing techniques are designed for discrete-time systems although there has been a growing interest to handle physical processes evolving in continuous time. As a first step toward their extension to continuous-time systems, this article aims to study the theory of maximum entropy optimal control in continuous time. Applying the dynamic programming principle, we derive a novel class of Hamilton-Jacobi-Bellman (HJB) equations and prove that the optimal value function of the maximum entropy control problem corresponds to the unique viscosity solution of the HJB equation. We further show that the optimal control is uniquely characterized as Gaussian in the case of control-affine systems and that, for linear-quadratic problems, the HJB equation is reduced to a Riccati equation, which can be used to obtain an explicit expression of the optimal control. The results of our numerical experiments demonstrate the performance of our maximum entropy method in continuous-time optimal control and reinforcement learning problems.
引用
收藏
页码:2018 / 2033
页数:16
相关论文
共 50 条
  • [31] Incremental stability and contraction via impulsive control for continuous-time dynamical systems
    Liu, Bin
    Xu, Bo
    Sun, Zhijie
    NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2021, 39 (39)
  • [32] Optimal Mismatched Disturbance Rejection Control for Continuous-Time Uncontrollable Systems
    Lv, Shichao
    Li, Hongdan
    Liu, Dongqing
    Li, Shihua
    Zhang, Huanshui
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024,
  • [33] Value Iteration and Adaptive Optimal Control for Linear Continuous-time Systems
    Bian, Tao
    Jiang, Zhong-Ping
    PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 53 - 58
  • [34] Comments on "On-line optimal tracking control of continuous-time systems"
    Atam, Ercan
    MECHATRONICS, 2009, 19 (08) : 1236 - 1239
  • [35] Optimal Control for Continuous-Time Scalar Nonlinear Systems With Known Dynamics
    Tymoshchuk, Pavlo
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 695 - 700
  • [36] Approximate Optimal Tracking Control for Continuous-Time Unknown Nonlinear Systems
    Na, Jing
    Lv, Yongfeng
    Wu, Xing
    Guo, Yu
    Chen, Qiang
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8984 - 8989
  • [37] Restricted-structure LOG optimal control for continuous-time systems
    Grimble, MJ
    IEE PROCEEDINGS-CONTROL THEORY AND APPLICATIONS, 2000, 147 (02): : 185 - 195
  • [38] Distributed optimal control for continuous-time nonaffine nonlinear interconnected systems
    Farzanegan, Behzad
    Suratgar, Amir Abolfazl
    Menhaj, Mohammad Bagher
    Zamani, Mohsen
    INTERNATIONAL JOURNAL OF CONTROL, 2022, 95 (12) : 3462 - 3476
  • [39] An iterative algorithm for the optimal control of continuous-time switched linear systems
    Bemporad, A
    Giua, A
    Seatzu, C
    WODES'02: SIXTH INTERNATIONAL WORKSHOP ON DISCRETE EVENT SYSTEMS, PROCEEDINGS, 2002, : 335 - 340
  • [40] STATIONARY OPTIMAL-CONTROL OF STOCHASTICALLY SAMPLED CONTINUOUS-TIME SYSTEMS
    DEKONING, WL
    AUTOMATICA, 1988, 24 (01) : 77 - 79