Maximum Entropy Optimal Density Control of Discrete-Time Linear Systems and Schrodinger Bridges

被引:3
作者
Ito, Kaito [1 ]
Kashima, Kenji [2 ]
机构
[1] Tokyo Inst Technol, Sch Comp, Yokohama 2268502, Japan
[2] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
关键词
Maximum entropy (MaxEnt); optimal control; Schrodinger bridge; stochastic control; COVARIANCE CONTROL; NONLINEAR-SYSTEMS; OPTIMAL TRANSPORT;
D O I
10.1109/TAC.2023.3305319
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider an entropy-regularized version of optimal density control of deterministic discrete-time linear systems. Entropy regularization, or a maximum entropy (MaxEnt) method for optimal control, has attracted much attention especially in reinforcement learning due to its many advantages, such as a natural exploration strategy. Despite the merits, high-entropy control policies induced by the regularization introduce probabilistic uncertainty into systems, which severely limits the applicability of MaxEnt optimal control to safety-critical systems. To remedy this situation, we impose a Gaussian density constraint at a specified time on the MaxEnt optimal control to directly control state uncertainty. Specifically, we derive the explicit form of the MaxEnt optimal density control. In addition, we also consider the case where density constraints are replaced by fixed-point constraints. Then, we characterize the associated state process as a pinned process, which is a generalization of the Brownian bridge to linear systems. Finally, we reveal that the MaxEnt optimal density control gives the so-called Schrodinger bridge associated with a discrete-time linear system.
引用
收藏
页码:1536 / 1551
页数:16
相关论文
共 41 条
[1]  
Bakolas E, 2020, P AMER CONTR CONF, P3595, DOI [10.23919/ACC45564.2020.9147505, 10.23919/acc45564.2020.9147505]
[2]   Finite-horizon covariance control for discrete-time stochastic linear systems subject to input constraints [J].
Bakolas, Efstathios .
AUTOMATICA, 2018, 91 :61-68
[3]  
Bakolas E, 2016, IEEE DECIS CONTR P, P1153
[4]   Covariance Steering of Discrete-Time Stochastic Linear Systems Based on Wasserstein Distance Terminal Cost [J].
Balci, Isin M. ;
Bakolas, Efstathios .
IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (06) :2000-2005
[5]   On the relative entropy of discrete-time Markov processes with given end-point densities [J].
Beghi, A .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1996, 42 (05) :1529-1535
[6]  
Beghi A., 1997, J. Math. Syst. Extimation Control, V7, P343
[7]  
Ben-Israel A., 2003, Generalized inverses: theory and applications, V15
[8]  
Blaquiere A., 1992, Dynamics and Control, V2, P235, DOI 10.1007/BF02169515
[9]   Wasserstein Proximal Algorithms for the Schrodinger Bridge Problem: Density Control With Nonlinear Drift [J].
Caluya, Kenneth F. ;
Halder, Abhishek .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (03) :1163-1178
[10]   Optimal Transport in Systems and Control [J].
Chen, Yongxin ;
Georgiou, Tryphon T. ;
Pavon, Michele .
ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 4, 2021, 2021, 4 :89-113