A Distributed Actor-Critic Learning Approach for Affine Formation Control of Multi-Robots With Unknown Dynamics

被引:1
|
作者
Zhang, Ronghua [1 ,2 ]
Ma, Qingwen [1 ]
Zhang, Xinglong [1 ]
Xu, Xin [1 ]
Liu, Daxue [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] Sichuan Univ Sci & Engn, Sch Mech Engn, Zigong, Peoples R China
关键词
affine formation control; data-driven; multi-robots; reinforcement learning; rollout; TIME NONLINEAR-SYSTEMS;
D O I
10.1002/acs.3972
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Formation maneuverability is particularly important for multi-robots (MRs), especially when the robots are operating cooperatively in complex and dynamic environments. Although various methods have been developed for affine formation, it is still a difficult problem to design an affine formation controller for MRs with unknown dynamics. In this paper, a distributed actor-critic learning approach (DACL) in a look-ahead rollout manner is proposed for the affine formation of MRs under local communication, which improves the online learning efficiency. In the proposed approach, a distributed data-driven online optimization mechanism is designed via the sparse kernel technique to solve the near-optimal affine formation control issue of MRs with unknown dynamics as well as improve control performance. The unknown dynamics of MRs are learned offline based on precollected input-output datasets, and the sparse kernel-based approach is employed to increase the feature representation capability of the samples. Then, the proposed distributed online actor-critic algorithm for each robot in the formation includes two neural networks, which are utilized to approximate the costate functions and the near-optimal policies. Moreover, the convergence analysis of the proposed approach has been conducted. Finally, numerical simulation and KKSwarm-based experiment studies are performed to verify the effectiveness of the proposed approach.
引用
收藏
页码:803 / 817
页数:15
相关论文
共 50 条
  • [31] Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control With Action Constraints
    Kasaura, Kazumi
    Miura, Shuwa
    Kozuno, Tadashi
    Yonetani, Ryo
    Hoshino, Kenta
    Hosoe, Yohei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08) : 4449 - 4456
  • [32] Autonomous Control of Lift System based on Actor-Critic Learning for Air Cushion Vehicle
    Zhou, Hua
    Wang, Yuanhui
    Jiyang, E.
    Wang, Xiaole
    OCEANS 2023 - LIMERICK, 2023,
  • [33] Actor-Critic Learning Algorithms for Mean-Field Control with Moment Neural Networks
    Pham, Huyen
    Warin, Xavier
    METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2025, 27 (01)
  • [34] An improved neuro-dynamics-based approach to online path planning for multi-robots in unknown dynamic environments
    Yi, Xin
    Zhu, Anmin
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2013, : 1 - 6
  • [35] An actor-critic algorithm for multi-agent learning in queue-based stochastic games
    Sundar, D. Krishna
    Ravikumar, K.
    NEUROCOMPUTING, 2014, 127 : 258 - 265
  • [36] Actor-critic multi-objective reinforcement learning for non-linear utility functions
    Reymond, Mathieu
    Hayes, Conor F.
    Steckelmacher, Denis
    Roijers, Diederik M.
    Nowe, Ann
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2023, 37 (02)
  • [37] Actor-critic multi-objective reinforcement learning for non-linear utility functions
    Mathieu Reymond
    Conor F. Hayes
    Denis Steckelmacher
    Diederik M. Roijers
    Ann Nowé
    Autonomous Agents and Multi-Agent Systems, 2023, 37
  • [38] Development and Validation of Active Roll Control based on Actor-critic Neural Network Reinforcement Learning
    Bahr, Matthias
    Reicherts, Sebastian
    Sieberg, Philipp
    Morss, Luca
    Schramm, Dieter
    SIMULTECH: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS, 2019, 2019, : 36 - 46
  • [39] The True Online Continuous Learning Automation (TOCLA) in a continuous control benchmarking of actor-critic algorithms
    Frost, Gordon
    Vallejo, Marta
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 266 - 275
  • [40] Adaptive Optimal Tracking Control of an Underactuated Surface Vessel Using Actor-Critic Reinforcement Learning
    Chen, Lin
    Dai, Shi-Lu
    Dong, Chao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7520 - 7533