Online adaptive critic flight control

被引:108
作者
Ferrari, S [1 ]
Stengel, RF
机构
[1] Duke Univ, Durham, NC 27707 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
关键词
D O I
10.2514/1.12597
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
A nonlinear control system comprising a network of networks is taught by the use of a two-phase learning procedure realized through novel training techniques and an adaptive critic design. The neural network controller is trained algebraically, offline, by the observation that its gradients must equal corresponding linear gain matrices at chosen operating points. Online learning by a dual heuristic adaptive critic architecture optimizes performance incrementally over time by accounting for plant dynamics and nonlinear effects that are revealed during large, coupled motions. The method is implemented to control the six-degree-of-freedom simulation of a business jet aircraft over its full operating envelope. The result is a controller that improves its performance while unexpected conditions, such as unmodeled dynamics, parameter variations, and control failures, are experienced for the first time.
引用
收藏
页码:777 / 786
页数:10
相关论文
共 31 条
[1]  
ASTROM KJ, 1972, COMMUN ACM, V15, P820
[2]   Adaptive-critic-based neural networks for aircraft optimal control [J].
Balakrishnan, SN ;
Biega, V .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (04) :893-898
[3]   UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[4]  
Bellman R., 1957, DYNAMIC PROGRAMMING
[5]  
Etkin B., 1972, DYNAMICS ATMOSPHERIC
[6]  
Ferrari S, 2001, P AMER CONTR CONF, P1605, DOI 10.1109/ACC.2001.945956
[7]   Classical/neural synthesis of nonlinear control systems [J].
Ferrari, S ;
Stengel, RF .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2002, 25 (03) :442-448
[8]  
FERRARI S, IN PRESS LEARNING AP
[9]  
Ferrari S., 2002, THESIS PRINCETON U P
[10]  
FRIEDLAND B, 1996, CONTROL HDB, P607