A Distributed Actor-Critic Learning Approach for Affine Formation Control of Multi-Robots With Unknown Dynamics

被引：1

作者：

Zhang, Ronghua ^{[1
,2
]}

Ma, Qingwen ^{[1
]}

Zhang, Xinglong ^{[1
]}

Xu, Xin ^{[1
]}

Liu, Daxue ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China

[2] Sichuan Univ Sci & Engn, Sch Mech Engn, Zigong, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING | 2025年

关键词：

affine formation control; data-driven; multi-robots; reinforcement learning; rollout; TIME NONLINEAR-SYSTEMS;

D O I：

10.1002/acs.3972

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Formation maneuverability is particularly important for multi-robots (MRs), especially when the robots are operating cooperatively in complex and dynamic environments. Although various methods have been developed for affine formation, it is still a difficult problem to design an affine formation controller for MRs with unknown dynamics. In this paper, a distributed actor-critic learning approach (DACL) in a look-ahead rollout manner is proposed for the affine formation of MRs under local communication, which improves the online learning efficiency. In the proposed approach, a distributed data-driven online optimization mechanism is designed via the sparse kernel technique to solve the near-optimal affine formation control issue of MRs with unknown dynamics as well as improve control performance. The unknown dynamics of MRs are learned offline based on precollected input-output datasets, and the sparse kernel-based approach is employed to increase the feature representation capability of the samples. Then, the proposed distributed online actor-critic algorithm for each robot in the formation includes two neural networks, which are utilized to approximate the costate functions and the near-optimal policies. Moreover, the convergence analysis of the proposed approach has been conducted. Finally, numerical simulation and KKSwarm-based experiment studies are performed to verify the effectiveness of the proposed approach.

引用

页码：803 / 817

页数：15

共 50 条

[41] Optimal synchronized control of nonlinear coupled harmonic oscillators based on actor-critic reinforcement learning
Gu, Zhiyang
Fan, Chengli
Yu, Dengxiu
Wang, Zhen
NONLINEAR DYNAMICS, 2023, 111 (22) : 21051 - 21064
[42] Multiagent Soft Actor-Critic Learning for Distributed ESS Enabled Robust Voltage Regulation of Active Distribution Grids
Chen, Yongdong
Liu, Youbo
Yin, Hang
Tang, Zhiyuan
Qiu, Gao
Liu, Junyong
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (09) : 11069 - 11080
[43] Neuromuscular control of sagittal arm during repetitive movement by Actor-Critic reinforcement learning method
Golkhou, V
Lucas, C
Parnianpour, M
Intelligent Automations and Control: Trends Principles, and Applications, Vol 16, 2004, 16 : 371 - 376
[44] Scalable Online Disease Diagnosis via Multi-Model-Fused Actor-Critic Reinforcement Learning
He, Weijie
Chen, Ting
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4695 - 4703
[45] Advancements in UAV Path Planning: A Deep Reinforcement Learning Approach with Soft Actor-Critic for Enhanced Navigation
Guo, Jingrui
Zhou, Guanzhong
Huang, Hailong
Huang, Chao
UNMANNED SYSTEMS, 2024,
[46] REINFORCEMENT LEARNING-BASED ADAPTIVE MOTION CONTROL FOR AUTONOMOUS VEHICLES VIA ACTOR-CRITIC STRUCTURE
Wang, Honghai
Wei, Liangfen
Wang, Xianchao
He, Shuping
DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES S, 2024, 17 (09): : 2894 - 2911
[47] Entropy regularized actor-critic based multi-agent deep reinforcement learning for stochastic games
Hao, Dong
Zhang, Dongcheng
Shi, Qi
Li, Kai
Information Sciences, 2022, 617 : 17 - 40
[48] Slice admission control in 5G wireless communication with multi-dimensional state space and distributed action space: A sequential twin actor-critic approach
Ojijo, Mourice Otieno
Ramotsoela, Daniel
Oginga, Ruth A.
COMPUTER NETWORKS, 2024, 255
[49] Formation control of multi-robots for on-orbit assembly of large solar sails
Hu, Quan
Zhang, Yao
Zhang, Jingrui
Hu, Haiyan
ACTA ASTRONAUTICA, 2016, 123 : 446 - 454
[50] Optimized distributed formation control using identifier-critic-actor reinforcement learning for a class of stochastic nonlinear multi-agent systems☆
Wen, Guoxing
Niu, Ben
ISA TRANSACTIONS, 2024, 155 : 1 - 10

← 1 2 3 4 5 →