Mixed H2/H∞- Policy Learning Synthesis

被引:2
作者
Molu, Lekan [1 ]
机构
[1] Microsoft Res, 300 Lafayette St, New York, NY 10012 USA
来源
IFAC PAPERSONLINE | 2023年 / 56卷 / 02期
关键词
Robust control; Data-driven optimal control; Machine learning in modelling; prediction; control and automation.prediction; control and automation;
D O I
10.1016/j.ifacol.2023.10.148
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A robustly stabilizing optimal control policy in a model-free mixed H2/H8-control setting is here put forward for counterbalancing the slow convergence and non-robustness of traditional high-variance policy optimization (and by extension policy gradient) algorithms. Leveraging It<^>o's stochastic differential calculus, we iteratively solve the system's continuoustime (closed-loop) generalized algebraic Riccati equation(GARE) whilst updating its admissible controllers in a two-player, zero-sum differential game setting. Our new results are illustrated by learning-enabled control systems which gather previously disseminated results in this field in one holistic data-driven presentation with greater simplification, improvement, and clarity.
引用
收藏
页码:9116 / 9123
页数:8
相关论文
共 22 条