Static Output-Feedback ${H}_{∞}$ Control Design Procedures for Continuous-Time Systems With Different Levels of Model Knowledge

被引:14
作者
Arogeti, Shai A. [1 ]
Lewis, Frank L. [2 ]
机构
[1] Ben Gurion Univ Negev, Dept Mech Engn, IL-84105 Beer Sheva, Israel
[2] Univ Texas Arlington, UTA Res Inst, Ft Worth, TX 76118 USA
基金
美国国家科学基金会;
关键词
Control design; Optimal control; Linear systems; Standards; Riccati equations; Convergence; Attenuation; H∞ optimal control; off-policy reinforcement learning (RL); static output feedback; POLICY UPDATE ALGORITHMS; LINEAR-SYSTEMS; TRACKING CONTROL; LEARNING CONTROL;
D O I
10.1109/TCYB.2021.3103148
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article suggests a collection of model-based and model-free output-feedback optimal solutions to a general ${H_{infinity}}$ control design criterion of a continuous-time linear system. The goal is to obtain a static output-feedback controller while the design criterion is formulated with an exponential term, divergent or convergent, depending on the designer's choice. Two offline policy-iteration algorithms are presented first, which form the foundations for a family of online off-policy designs. These algorithms cover all different cases of partial or complete model knowledge and provide the designer with a collection of design alternatives. It is shown that such a design for partial model knowledge can reduce the number of unknown matrices to be solved online. In particular, if the disturbance input matrix of the model is given, off-policy learning can be done with no disturbance excitation. This alternative is useful in situations where a measurable disturbance is not available in the learning phase. The utility of these design procedures is demonstrated for the case of an optimal lane tracking controller of an automated car.
引用
收藏
页码:1432 / 1446
页数:15
相关论文
共 42 条
[1]   A stable neural network-based observer with application to flexible-joint manipulators [J].
Abdollahi, F ;
Talebi, HA ;
Patel, RV .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (01) :118-129
[2]   Adaptive critic designs for discrete-time zero-sum games with application to H∞ control [J].
Al-Tamimi, Asma ;
Abu-Khalaf, Murad ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (01) :240-247
[3]  
Anderson B., 1971, LINEAR OPTIMAL CONTR
[4]   Path Following of Autonomous Vehicles in the Presence of Sliding Effects [J].
Arogeti, Shai A. ;
Berman, Nadav .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2012, 61 (04) :1481-1492
[5]   Issues on stability of ADP feedback controllers for dynamical systems [J].
Balakrishnan, S. N. ;
Ding, Jie ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :913-917
[6]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[7]  
Beard R., 1995, Improving the Closed-Loop Performance of Nonlinear Systems
[8]   STATE-SPACE SOLUTIONS TO STANDARD H-2 AND H-INFINITY CONTROL-PROBLEMS [J].
DOYLE, JC ;
GLOVER, K ;
KHARGONEKAR, PP ;
FRANCIS, BA .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1989, 34 (08) :831-847
[9]  
Flockerzi D, 1993, TOPICS CONTROL THEOR, P43
[10]   Necessary and sufficient conditions for H-∞ static output-feedback control [J].
Gadewadikar, Jyotirmay ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2006, 29 (04) :915-920