Kernel-based Consensus Control of Multi-agent Systems with Unknown System Dynamics

被引:0
作者
Wei Wang
Changyang Feng
机构
[1] Zhongnan University of Economics and Law,School of Information and Safety Engineering
[2] Central China Normal University,School of Information Management
来源
International Journal of Control, Automation and Systems | 2023年 / 21卷
关键词
Adaptive dynamic programming; consensus control; kernel-based method; multi-agent systems; optimal control;
D O I
暂无
中图分类号
学科分类号
摘要
A novel method for optimal consensus control of multi-agent systems (MASs) based on adaptive dynamic programming (ADP) is developed in this paper. Unlike neural networks (NNs) that require manually designed features in value function approximation and may effect the approximation quality. Kernel-based methods are adopted to approximate value functions without predefining the model structure. Moreover, to handle the challenge of unknown or complex system dynamics, a local action-value function is defined and kernel-based methods are used to approximate the local action-value function. Thus, an action dependent heuristic dynamic programming (ADHDP) approach that uses kernel-based local action-value function approximation to achieve the model-free optimal consensus control is developed in this paper. The developed approach learns the system dynamics from historical data and avoids the need for system identification. The effectiveness of the developed approach is demonstrated with two simulation examples.
引用
收藏
页码:2398 / 2408
页数:10
相关论文
共 87 条
[1]  
Wang M(2021)Leader-following formation control of second-order nonlinear systems with time-varying communication delay International Journal of Control, Automation, and Systems 19 1729-1739
[2]  
Zhang T(2014)Analysis of flocking of cooperative multiple inertial agents via a geometric decomposition technique IEEE Transactions on Systems, Man, and Cybernetics: Systems 44 1611-1623
[3]  
Li W(2021)Anti-collision and obstacle avoidance of mobile sensor-plus-actuator networks over distributed parameter systems with time-varying delay International Journal of Control, Automation, and Systems 19 2373-2384
[4]  
Spong M W(2017)Multiagent framework for smart grids recovery IEEE Transactions on Systems, Man, and Cybernetics: Systems 47 1284-1300
[5]  
Fu H(2004)Consensus problems in networks of agents with switching topology and time-delays IEEE Transactions on Automatic Control 49 1520-1533
[6]  
Cui B(2007)Consensus and cooperation in networked multi-agent systems Proceedings of the IEEE 95 215-233
[7]  
Zhuang B(2014)Coordination for linear multiagent systems with dynamic interaction topology in the leader-following framework IEEE Transactions on Industrial Electronics 61 241-2422
[8]  
Zhang J(2011)Multi-player nonzero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations Automatica 47 1556-1569
[9]  
Meskina S B(2019)Data-driven adaptive dynamic programming for partially observable nonzero-sum games via Q-learning method International Journal of Systems Science 50 1338-1352
[10]  
Doggaz N(2020)Model-free adaptive optimal control of episodic fixed-horizon manufacturing processes using reinforcement learning International Journal of Control, Automation, and Systems 18 1593-1604