Optimal synchronized control of nonlinear coupled harmonic oscillators based on actor-critic reinforcement learning

被引:3
作者
Gu, Zhiyang [1 ]
Fan, Chengli [2 ]
Yu, Dengxiu [3 ]
Wang, Zhen [4 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China
[2] Air Force Engn Univ, Air & Missile Def Coll, Xian, Shaanxi, Peoples R China
[3] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Shaanxi, Peoples R China
[4] Northwestern Polytech Univ, Ctr Opt Imagery Anal & Learning, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Coupled harmonic oscillator; Reinforcement learning; Backstepping control; Synchronization; Nonlinear dynamics; SYSTEMS; TRANSITION;
D O I
10.1007/s11071-023-08957-y
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
A distributed optimal control algorithm based on adaptive neural network is proposed for the synchronized control problem of a class of second-order nonlinear coupled harmonic oscillators. Firstly, the graph theory is used to establish the coupling relationship between the harmonic oscillator models; secondly, the neural network is used to fit the unknown nonlinearity in the harmonic oscillator model, and the virtual controller and the actual controller are designed based on the backstepping method; then, according to the state error and the controller, the cost function and the HJB function are designed. Since the HJB function cannot be solved directly, the critic neural network approximates its solution. The above two neural networks constitute a simplified reinforcement learning to achieve optimal consistent control of nonlinear coupled harmonic oscillators. Finally, the stability and effectiveness of the scheme are verified by the Lyapunov stability theorem and numerical simulation, respectively.
引用
收藏
页码:21051 / 21064
页数:14
相关论文
共 40 条
  • [1] Distributed discrete-time coupled harmonic oscillators with application to synchronised motion coordination
    Ballard, L.
    Cao, Y.
    Ren, W.
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2010, 4 (05) : 806 - 816
  • [2] Adaptive Neural Network Control of AUVs With Control Input Nonlinearities Using Reinforcement Learning
    Cui, Rongxin
    Yang, Chenguang
    Li, Yang
    Sharma, Sanjay
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (06): : 1019 - 1029
  • [3] Exponential synchronization of nonlinearly coupled complex networks with hybrid time-varying delays via impulsive control
    Feng, Jianwen
    Yu, Fenfen
    Zhao, Yi
    [J]. NONLINEAR DYNAMICS, 2016, 85 (01) : 621 - 632
  • [4] Neural network-based nonlinear sliding-mode control for an AUV without velocity measurements
    Guo, Xinxin
    Yan, Weisheng
    Cui, Rongxin
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2019, 92 (03) : 677 - 692
  • [5] Swarm control for large-scale omnidirectional mobile robots within incremental behavior
    Jin, Xiaoyue
    Wang, Zhen
    Zhao, Junsheng
    Yu, Dengxiu
    [J]. INFORMATION SCIENCES, 2022, 614 : 35 - 50
  • [6] Kennedy JOS., 1986, Introduction to dynamic programming, in Dynamic Programming, P27
  • [7] Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning
    Lan, Jie
    Liu, Yan-Jun
    Yu, Dengxiu
    Wen, Guoxing
    Tong, Shaocheng
    Liu, Lei
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3144 - 3155
  • [8] Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game
    Li, Hongyang
    Wei, Qinglai
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (07) : 1010 - 1019
  • [9] Observer-Based Neuro-Adaptive Optimized Control of Strict-Feedback Nonlinear Systems With State Constraints
    Li, Yongming
    Liu, Yanjun
    Tong, Shaocheng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 3131 - 3145
  • [10] Optimal output synchronization of heterogeneous multi-agent systems using measured input-output data
    Liu, Yingying
    Wang, Zhanshan
    [J]. INFORMATION SCIENCES, 2022, 582 : 462 - 479