A Distributed Actor-Critic Learning Approach for Affine Formation Control of Multi-Robots With Unknown Dynamics

被引：0

作者：

Zhang, Ronghua ^{[1
,2
]}

Ma, Qingwen ^{[1
]}

Zhang, Xinglong ^{[1
]}

Xu, Xin ^{[1
]}

Liu, Daxue ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China

[2] Sichuan Univ Sci & Engn, Sch Mech Engn, Zigong, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING | 2025年

关键词：

affine formation control; data-driven; multi-robots; reinforcement learning; rollout; TIME NONLINEAR-SYSTEMS;

D O I：

10.1002/acs.3972

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Formation maneuverability is particularly important for multi-robots (MRs), especially when the robots are operating cooperatively in complex and dynamic environments. Although various methods have been developed for affine formation, it is still a difficult problem to design an affine formation controller for MRs with unknown dynamics. In this paper, a distributed actor-critic learning approach (DACL) in a look-ahead rollout manner is proposed for the affine formation of MRs under local communication, which improves the online learning efficiency. In the proposed approach, a distributed data-driven online optimization mechanism is designed via the sparse kernel technique to solve the near-optimal affine formation control issue of MRs with unknown dynamics as well as improve control performance. The unknown dynamics of MRs are learned offline based on precollected input-output datasets, and the sparse kernel-based approach is employed to increase the feature representation capability of the samples. Then, the proposed distributed online actor-critic algorithm for each robot in the formation includes two neural networks, which are utilized to approximate the costate functions and the near-optimal policies. Moreover, the convergence analysis of the proposed approach has been conducted. Finally, numerical simulation and KKSwarm-based experiment studies are performed to verify the effectiveness of the proposed approach.

引用

页数：15

共 50 条

[1] Multi-actor mechanism for actor-critic reinforcement learning
Li, Lin
Li, Yuze
Wei, Wei
Zhang, Yujia
Liang, Jiye
INFORMATION SCIENCES, 2023, 647
[2] USING ACTOR-CRITIC REINFORCEMENT LEARNING FOR CONTROL AND FLIGHT FORMATION OF QUADROTORS
Torres, Edgar
Xu, Lei
Sardarmehni, Tohid
PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
[3] MULTI-STEP ACTOR-CRITIC FRAMEWORK FOR REINFORCEMENT LEARNING IN CONTINUOUS CONTROL
Huang T.
Chen G.
Journal of Applied and Numerical Optimization, 2023, 5 (02): : 189 - 200
[4] On the Role of Models in Learning Control: Actor-Critic Iterative Learning Control
Poot, Maurice
Portegies, Jim
Oomen, Tom
IFAC PAPERSONLINE, 2020, 53 (02): : 1450 - 1455
[5] Actor-Critic Reinforcement Learning for Control With Stability Guarantee
Han, Minghao
Zhang, Lixian
Wang, Jun
Pan, Wei
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6217 - 6224
[6] A Novel Actor-Critic Motor Reinforcement Learning for Continuum Soft Robots
Pantoja-Garcia, Luis
Parra-Vega, Vicente
Garcia-Rodriguez, Rodolfo
Vazquez-Garcia, Carlos Ernesto
ROBOTICS, 2023, 12 (05)
[7] Learning Locomotion for Quadruped Robots via Distributional Ensemble Actor-Critic
Li, Sicen
Pang, Yiming
Bai, Panju
Li, Jiawei
Liu, Zhaojin
Hu, Shihao
Wang, Liquan
Wang, Gang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1811 - 1818
[8] Model-Based Actor-Critic Learning for Optimal Tracking Control of Robots With Input Saturation
Zhao, Xingwei
Tao, Bo
Qian, Lu
Ding, Han
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (06) : 5046 - 5056
[9] Actor-critic learning based PID control for robotic manipulators
Nohooji, Hamed Rahimi
Zaraki, Abolfazl
Voos, Holger
APPLIED SOFT COMPUTING, 2024, 151
[10] An Object Oriented Approach to Fuzzy Actor-Critic Learning for Multi-Agent Differential Games
Schwartz, Howard
2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 183 - 190

← 1 2 3 4 5 →