Excitation for Adaptive Optimal Control of Nonlinear Systems in Differential Games

被引:6
作者
Karg, Philipp [1 ]
Koepf, Florian [1 ]
Braun, Christian A. [1 ]
Hohmann, Soeren [1 ]
机构
[1] Karlsruhe Inst Technol, Inst Control Syst, D-76131 Karlsruhe, Germany
关键词
Adaptive dynamic programming (ADP); adaptive optimal control; persistent excitation (PE); ZERO-SUM GAMES; PARAMETER CONVERGENCE;
D O I
10.1109/TAC.2022.3145651
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article focuses on the fulfillment of the persistent excitation (PE) condition for signals which result from transformations by means of polynomials. This is essential, e.g., for the convergence of adaptive dynamic programming algorithms due to commonly used polynomial function approximators. As theoretical statements are scarce regarding the nonlinear transformation of PE signals, we propose conditions on the system state such that its transformation by polynomials is PE. To validate our theoretical statements, we develop an exemplary excitation procedure based on our conditions using a feed-forward control approach and demonstrate the effectiveness of our method in a nonzero-sum differential game. In this setting, our approach outperforms commonly used probing noise in terms of convergence time and the degree of PE, shown by a numerical example.
引用
收藏
页码:596 / 603
页数:8
相关论文
共 31 条
[1]   Finite-time parameter estimation in adaptive control of nonlinear systems [J].
Adetola, Veronica ;
Guay, Martin .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (03) :807-811
[2]  
[Anonymous], 1989, Nonlinear Control System
[3]  
[Anonymous], 1980, Measure Theory
[4]  
Astrom KJ, 1965, IFAC Proc Vol, V2, P96, DOI DOI 10.1016/S1474-6670(17)69024-4
[5]  
Basar T., 1999, Dynamic Noncooperative Game Theory, V23
[6]   NECESSARY AND SUFFICIENT CONDITIONS FOR PARAMETER CONVERGENCE IN ADAPTIVE-CONTROL [J].
BOYD, S ;
SASTRY, SS .
AUTOMATICA, 1986, 22 (06) :629-639
[7]   Concurrent Learning for Convergence in Adaptive Control without Persistency of Excitation [J].
Chowdhary, Girish ;
Johnson, Eric .
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, :3674-3679
[8]   FREQUENCY-DOMAIN CONDITIONS FOR PARAMETER CONVERGENCE IN MULTIVARIABLE RECURSIVE-IDENTIFICATION [J].
DEMATHELIN, M ;
BODSON, M .
AUTOMATICA, 1990, 26 (04) :757-767
[9]   Cooperative Shared Control Driver Assistance Systems Based on Motion Primitives and Differential Games [J].
Flad, Michael ;
Froehlich, Lukas ;
Hoehmann, Soren .
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2017, 47 (05) :711-722
[10]   FLATNESS AND DEFECT OF NONLINEAR-SYSTEMS - INTRODUCTORY THEORY AND EXAMPLES [J].
FLIESS, M ;
LEVINE, J ;
MARTIN, P ;
ROUCHON, P .
INTERNATIONAL JOURNAL OF CONTROL, 1995, 61 (06) :1327-1361