An Online Single-Network Adaptive Algorithm for Continuous-Time Nonlinear Optimal Control

被引:0
作者
Lee, Jae Young [1 ]
Park, Jin Bae [1 ]
Choi, Yoon Ho [2 ]
Lee, Keun Uk [1 ]
机构
[1] Yonsei Univ, Dept Elect & Elect Engn, Seoul 120749, South Korea
[2] Kyonggi Univ, Dept Elect Engn, Kyonggi Do, South Korea
来源
2013 13TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2013) | 2013年
基金
新加坡国家研究基金会;
关键词
Adaptive control; optimal control; actor-critic; approximate dynamic programming; nonlinear control; SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose an online adaptive neural-algorithm to solve the CT nonlinear optimal control problems. Compared to the existing methods, which adopt the architecture with two neural networks (NNs) for actor-critic implementations, only one NN for critic is used to implement the algorithm, simplifying the structure of the computation model. Moreover, we also provide a generalized learning rule for updating the NN weights, which covers the existing critic update rules as special cases. The theoretical and numerical results are given under the required persistent excitation condition to verify and analyze stability and performance of the proposed method.
引用
收藏
页码:1687 / 1690
页数:4
相关论文
共 14 条
  • [1] Bounded robust control of nonlinear systems using neural network-based HJB solution
    Adhyaru, Dipak M.
    Kar, I. N.
    Gopal, M.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2011, 20 (01) : 91 - 103
  • [2] [Anonymous], 2004, IEEE T AUTOMAT CONTR, DOI DOI 10.1109/TAC.1972.1100008
  • [3] Bhasin S., 2012, AUTOMATICA, V49
  • [4] Continuous-time adaptive critics
    Hanselmann, Thomas
    Noakes, Lyle
    Zaknich, Anthony
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (03): : 631 - 647
  • [5] Lewis F., 1995, Optimal control
  • [6] Munos Remi., 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339), V3, P2152, DOI [10.1109/IJCNN.1999.832721, DOI 10.1109/IJCNN.1999.832721]
  • [7] A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
    Padhi, Radhakant
    Unnikrishnan, Nishant
    Wang, Xiaohua
    Balakrishnan, S. N.
    [J]. NEURAL NETWORKS, 2006, 19 (10) : 1648 - 1660
  • [8] Adaptive critic designs
    Prokhorov, DV
    Wunsch, DC
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (05): : 997 - 1007
  • [9] Sastry S., 1989, Adaptive Control: Stability Convergence and Robustness
  • [10] Si J., 2004, HDB LEARNING APPROXI