Extremum seeking for optimal control problems with unknown time-varying systems and unknown objective functions

被引:10
作者
Scheinker, Alexander [1 ]
Scheinker, David [2 ]
机构
[1] Los Alamos Natl Lab, Los Alamos, NM 87545 USA
[2] Stanford Univ, Stanford, CA 94305 USA
关键词
extremum seeking; industrial applications of optimal control; iterative schemes; optimal control; optimal controller synthesis for systems with uncertainties; time-varying systems; ITERATIVE LEARNING CONTROL; LIE BRACKET APPROXIMATION; OPTIMAL TRACKING CONTROL; ADAPTIVE OPTIMAL-CONTROL; HORIZON LQ CONTROL; LINEAR-SYSTEMS; FEEDBACK; OPTIMIZATION; INTEGRALS; DESIGN;
D O I
10.1002/acs.3097
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of optimal feedback control of an unknown, noisy, time-varying, dynamic system that is initialized repeatedly. Examples include a robotic manipulator which must perform the same motion, such as assisting a human, repeatedly and accelerating cavities in particle accelerators which are turned on for a fraction of a second with given initial conditions and vary slowly due to temperature fluctuations. We present an approach that applies to systems of practical interest. The method presented here is model independent; does not require knowledge of the objective function; is robust to measurement noise; is applicable for any set of initial conditions; is applicable to simultaneously controlling an arbitrary number of parameters; and may be implemented with a broad range of continuous or discontinuous functions such as sine or square waves. For systems with convex cost functions we prove that our algorithm will produce controllers that approach the minimal cost. For linear systems we reproduce the cost minimizing linear quadratic regulator optimal controller that could have been designed analytically had the system and cost function been known. We demonstrate the effectiveness of the algorithm with simulation studies of noisy and time-varying systems.
引用
收藏
页码:1143 / 1161
页数:19
相关论文
共 52 条
[1]   Iterative learning control: Brief survey and categorization [J].
Ahn, Hyo-Sung ;
Chen, YangQuan ;
Moore, Kevin L. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2007, 37 (06) :1099-1121
[2]   BETTERING OPERATION OF ROBOTS BY LEARNING [J].
ARIMOTO, S ;
KAWAMURA, S ;
MIYAZAKI, F .
JOURNAL OF ROBOTIC SYSTEMS, 1984, 1 (02) :123-140
[3]   Learning-based iterative modular adaptive control for nonlinear systems [J].
Benosman, Mouhacine ;
Farahmand, Amir-Massoud ;
Xia, Meng .
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2019, 33 (02) :335-355
[4]   Multi-parametric extremum seeking-based iterative feedback gains tuning for nonlinear control [J].
Benosman, Mouhacine .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2016, 26 (18) :4035-4055
[5]   Extremum seeking-based adaptive control for electromagnetic actuators [J].
Benosman, Mouhacine ;
Atinc, Goekhan M. .
INTERNATIONAL JOURNAL OF CONTROL, 2015, 88 (03) :517-530
[6]   Control and optimization of batch processes [J].
Bonvin, Dominique ;
Srinivasan, Bala ;
Hunkeler, David .
IEEE CONTROL SYSTEMS MAGAZINE, 2006, 26 (06) :34-45
[7]   A survey of iterative learning control [J].
Bristow, Douglas A. ;
Tharayil, Marina ;
Alleyne, Andrew G. .
IEEE CONTROL SYSTEMS MAGAZINE, 2006, 26 (03) :96-114
[8]   Adaptive iterative learning control of uncertain robotic systems [J].
Choi, JY ;
Lee, JS .
IEE PROCEEDINGS-CONTROL THEORY AND APPLICATIONS, 2000, 147 (02) :217-223
[9]   Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence [J].
Dierks, Travis ;
Thumati, Balaje T. ;
Jagannathan, S. .
NEURAL NETWORKS, 2009, 22 (5-6) :851-860
[10]   Lie bracket approximation of extremum seeking systems [J].
Duerr, Hans-Bernd ;
Stankovic, Milos S. ;
Ebenbauer, Christian ;
Johansson, Karl Henrik .
AUTOMATICA, 2013, 49 (06) :1538-1552