Kernel-based Consensus Control of Multi-agent Systems with Unknown System Dynamics

被引:0
作者
Wei Wang
Changyang Feng
机构
[1] Zhongnan University of Economics and Law,School of Information and Safety Engineering
[2] Central China Normal University,School of Information Management
来源
International Journal of Control, Automation and Systems | 2023年 / 21卷
关键词
Adaptive dynamic programming; consensus control; kernel-based method; multi-agent systems; optimal control;
D O I
暂无
中图分类号
学科分类号
摘要
A novel method for optimal consensus control of multi-agent systems (MASs) based on adaptive dynamic programming (ADP) is developed in this paper. Unlike neural networks (NNs) that require manually designed features in value function approximation and may effect the approximation quality. Kernel-based methods are adopted to approximate value functions without predefining the model structure. Moreover, to handle the challenge of unknown or complex system dynamics, a local action-value function is defined and kernel-based methods are used to approximate the local action-value function. Thus, an action dependent heuristic dynamic programming (ADHDP) approach that uses kernel-based local action-value function approximation to achieve the model-free optimal consensus control is developed in this paper. The developed approach learns the system dynamics from historical data and avoids the need for system identification. The effectiveness of the developed approach is demonstrated with two simulation examples.
引用
收藏
页码:2398 / 2408
页数:10
相关论文
共 87 条
[21]  
Vamvoudakis K G(2004)The kernel recursive least-squares algorithm IEEE Transactions on Signal Processing 52 2275-2285
[22]  
Lewis F L(2009)Robust finite-time consensus tracking algorithm for multirobot systems IEEE/ASME Transactions on Mechatronics 14 219-228
[23]  
Wang W(2009)Natural actor-critic algorithms Automatica 45 2471-2482
[24]  
Chen X(2007)Kernel-based least-squares policy iteration for reinforcement learning IEEE Transactions on Neural Networks 18 973-992
[25]  
Fu H(2021)Topology-induced containment for general linear systems on weakly connected digraphs Automatica 131 109734-1294
[26]  
Wu M(2019)Discrete-time selfish routing converging to the Wardrop equilibrium IEEE Transactions on Automatic Control 64 1288-undefined
[27]  
Dornheim J(undefined)undefined undefined undefined undefined-undefined
[28]  
Link N(undefined)undefined undefined undefined undefined-undefined
[29]  
Gumbsch P(undefined)undefined undefined undefined undefined-undefined
[30]  
Zhang H G(undefined)undefined undefined undefined undefined-undefined