Mixed H2/H∞- Policy Learning Synthesis

被引：2

作者：

Molu, Lekan ^{[1
]}

机构：

[1] Microsoft Res, 300 Lafayette St, New York, NY 10012 USA

来源：

IFAC PAPERSONLINE | 2023年 / 56卷 / 02期

关键词：

Robust control; Data-driven optimal control; Machine learning in modelling; prediction; control and automation.prediction; control and automation;

D O I：

10.1016/j.ifacol.2023.10.148

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A robustly stabilizing optimal control policy in a model-free mixed H2/H8-control setting is here put forward for counterbalancing the slow convergence and non-robustness of traditional high-variance policy optimization (and by extension policy gradient) algorithms. Leveraging It<^>o's stochastic differential calculus, we iteratively solve the system's continuoustime (closed-loop) generalized algebraic Riccati equation(GARE) whilst updating its admissible controllers in a two-player, zero-sum differential game setting. Our new results are illustrated by learning-enabled control systems which gather previously disseminated results in this field in one holistic data-driven presentation with greater simplification, improvement, and clarity.

引用

页码：9116 / 9123

页数：8

共 22 条

[1] Agarwal Alekh, 2021, JOURNAL OF MACHINE LEARNING RESEARCH, V22
[2] Basar T., 2008, H-Optimal Control and Related Minimax Design Problems: A Dynamic Game Approach
[3] LQG CONTROL WITH AN H-INFINITY PERFORMANCE BOUND - A RICCATI EQUATION APPROACH
BERNSTEIN, DS
HADDAD, WM
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1989, 34 (03) : 293 - 305
[4] Billings SA, 2013, NONLINEAR SYSTEM IDENTIFICATION: NARMAX METHODS IN THE TIME, FREQUENCY, AND SPATIO-TEMPORAL DOMAINS, P1, DOI 10.1002/9781118535561
[5] A FAST ALGORITHM TO COMPUTE THE H-INFINITY-NORM OF A TRANSFER-FUNCTION MATRIX
BRUINSMA, NA
STEINBUCH, M
[J]. SYSTEMS & CONTROL LETTERS, 1990, 14 (04) : 287 - 293
[6] ORTHOGONAL LEAST-SQUARES METHODS AND THEIR APPLICATION TO NON-LINEAR SYSTEM-IDENTIFICATION
CHEN, S
BILLINGS, SA
LUO, W
[J]. INTERNATIONAL JOURNAL OF CONTROL, 1989, 50 (05) : 1873 - 1896
[7] Cui LL, 2022, Arxiv, DOI arXiv:2209.04477
[8] Linear-Exponential-Quadratic Gaussian Control
Duncan, Tyrone E.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2013, 58 (11) : 2910 - 2911
[9] OPTIMAL STOCHASTIC LINEAR-SYSTEMS WITH EXPONENTIAL PERFORMANCE CRITERIA AND THEIR RELATION TO DETERMINISTIC DIFFERENTIAL GAMES
JACOBSON, DH
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1973, AC18 (02) : 124 - 131
[10] Kakade S, 2002, ADV NEUR IN, V14, P1531

← 1 2 3 →