Optimal synchronized control of nonlinear coupled harmonic oscillators based on actor-critic reinforcement learning

被引：3

作者：

Gu, Zhiyang ^{[1
]}

Fan, Chengli ^{[2
]}

Yu, Dengxiu ^{[3
]}

Wang, Zhen ^{[4
]}

机构：

[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China

[2] Air Force Engn Univ, Air & Missile Def Coll, Xian, Shaanxi, Peoples R China

[3] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Shaanxi, Peoples R China

[4] Northwestern Polytech Univ, Ctr Opt Imagery Anal & Learning, Xian 710072, Shaanxi, Peoples R China

来源：

NONLINEAR DYNAMICS | 2023年 / 111卷 / 22期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Coupled harmonic oscillator; Reinforcement learning; Backstepping control; Synchronization; Nonlinear dynamics; SYSTEMS; TRANSITION;

D O I：

10.1007/s11071-023-08957-y

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

A distributed optimal control algorithm based on adaptive neural network is proposed for the synchronized control problem of a class of second-order nonlinear coupled harmonic oscillators. Firstly, the graph theory is used to establish the coupling relationship between the harmonic oscillator models; secondly, the neural network is used to fit the unknown nonlinearity in the harmonic oscillator model, and the virtual controller and the actual controller are designed based on the backstepping method; then, according to the state error and the controller, the cost function and the HJB function are designed. Since the HJB function cannot be solved directly, the critic neural network approximates its solution. The above two neural networks constitute a simplified reinforcement learning to achieve optimal consistent control of nonlinear coupled harmonic oscillators. Finally, the stability and effectiveness of the scheme are verified by the Lyapunov stability theorem and numerical simulation, respectively.

引用

页码：21051 / 21064

页数：14

共 40 条

[1] Distributed discrete-time coupled harmonic oscillators with application to synchronised motion coordination
Ballard, L.
Cao, Y.
Ren, W.
[J]. IET CONTROL THEORY AND APPLICATIONS, 2010, 4 (05) : 806 - 816
[2] Adaptive Neural Network Control of AUVs With Control Input Nonlinearities Using Reinforcement Learning
Cui, Rongxin
Yang, Chenguang
Li, Yang
Sharma, Sanjay
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (06): : 1019 - 1029
[3] Exponential synchronization of nonlinearly coupled complex networks with hybrid time-varying delays via impulsive control
Feng, Jianwen
Yu, Fenfen
Zhao, Yi
[J]. NONLINEAR DYNAMICS, 2016, 85 (01) : 621 - 632
[4] Neural network-based nonlinear sliding-mode control for an AUV without velocity measurements
Guo, Xinxin
Yan, Weisheng
Cui, Rongxin
[J]. INTERNATIONAL JOURNAL OF CONTROL, 2019, 92 (03) : 677 - 692
[5] Swarm control for large-scale omnidirectional mobile robots within incremental behavior
Jin, Xiaoyue
Wang, Zhen
Zhao, Junsheng
Yu, Dengxiu
[J]. INFORMATION SCIENCES, 2022, 614 : 35 - 50
[6] Kennedy JOS., 1986, Introduction to dynamic programming, in Dynamic Programming, P27
[7] Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning
Lan, Jie
Liu, Yan-Jun
Yu, Dengxiu
Wen, Guoxing
Tong, Shaocheng
Liu, Lei
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3144 - 3155
[8] Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game
Li, Hongyang
Wei, Qinglai
[J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (07) : 1010 - 1019
[9] Observer-Based Neuro-Adaptive Optimized Control of Strict-Feedback Nonlinear Systems With State Constraints
Li, Yongming
Liu, Yanjun
Tong, Shaocheng
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 3131 - 3145
[10] Optimal output synchronization of heterogeneous multi-agent systems using measured input-output data
Liu, Yingying
Wang, Zhanshan
[J]. INFORMATION SCIENCES, 2022, 582 : 462 - 479

← 1 2 3 4 →