A Distributed Actor-Critic Learning Approach for Affine Formation Control of Multi-Robots With Unknown Dynamics

被引:1
|
作者
Zhang, Ronghua [1 ,2 ]
Ma, Qingwen [1 ]
Zhang, Xinglong [1 ]
Xu, Xin [1 ]
Liu, Daxue [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] Sichuan Univ Sci & Engn, Sch Mech Engn, Zigong, Peoples R China
关键词
affine formation control; data-driven; multi-robots; reinforcement learning; rollout; TIME NONLINEAR-SYSTEMS;
D O I
10.1002/acs.3972
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Formation maneuverability is particularly important for multi-robots (MRs), especially when the robots are operating cooperatively in complex and dynamic environments. Although various methods have been developed for affine formation, it is still a difficult problem to design an affine formation controller for MRs with unknown dynamics. In this paper, a distributed actor-critic learning approach (DACL) in a look-ahead rollout manner is proposed for the affine formation of MRs under local communication, which improves the online learning efficiency. In the proposed approach, a distributed data-driven online optimization mechanism is designed via the sparse kernel technique to solve the near-optimal affine formation control issue of MRs with unknown dynamics as well as improve control performance. The unknown dynamics of MRs are learned offline based on precollected input-output datasets, and the sparse kernel-based approach is employed to increase the feature representation capability of the samples. Then, the proposed distributed online actor-critic algorithm for each robot in the formation includes two neural networks, which are utilized to approximate the costate functions and the near-optimal policies. Moreover, the convergence analysis of the proposed approach has been conducted. Finally, numerical simulation and KKSwarm-based experiment studies are performed to verify the effectiveness of the proposed approach.
引用
收藏
页码:803 / 817
页数:15
相关论文
共 50 条
  • [41] Optimal synchronized control of nonlinear coupled harmonic oscillators based on actor-critic reinforcement learning
    Gu, Zhiyang
    Fan, Chengli
    Yu, Dengxiu
    Wang, Zhen
    NONLINEAR DYNAMICS, 2023, 111 (22) : 21051 - 21064
  • [42] Multiagent Soft Actor-Critic Learning for Distributed ESS Enabled Robust Voltage Regulation of Active Distribution Grids
    Chen, Yongdong
    Liu, Youbo
    Yin, Hang
    Tang, Zhiyuan
    Qiu, Gao
    Liu, Junyong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (09) : 11069 - 11080
  • [43] Neuromuscular control of sagittal arm during repetitive movement by Actor-Critic reinforcement learning method
    Golkhou, V
    Lucas, C
    Parnianpour, M
    Intelligent Automations and Control: Trends Principles, and Applications, Vol 16, 2004, 16 : 371 - 376
  • [44] Scalable Online Disease Diagnosis via Multi-Model-Fused Actor-Critic Reinforcement Learning
    He, Weijie
    Chen, Ting
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4695 - 4703
  • [45] Advancements in UAV Path Planning: A Deep Reinforcement Learning Approach with Soft Actor-Critic for Enhanced Navigation
    Guo, Jingrui
    Zhou, Guanzhong
    Huang, Hailong
    Huang, Chao
    UNMANNED SYSTEMS, 2024,
  • [46] REINFORCEMENT LEARNING-BASED ADAPTIVE MOTION CONTROL FOR AUTONOMOUS VEHICLES VIA ACTOR-CRITIC STRUCTURE
    Wang, Honghai
    Wei, Liangfen
    Wang, Xianchao
    He, Shuping
    DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES S, 2024, 17 (09): : 2894 - 2911
  • [47] Entropy regularized actor-critic based multi-agent deep reinforcement learning for stochastic games
    Hao, Dong
    Zhang, Dongcheng
    Shi, Qi
    Li, Kai
    Information Sciences, 2022, 617 : 17 - 40
  • [48] Slice admission control in 5G wireless communication with multi-dimensional state space and distributed action space: A sequential twin actor-critic approach
    Ojijo, Mourice Otieno
    Ramotsoela, Daniel
    Oginga, Ruth A.
    COMPUTER NETWORKS, 2024, 255
  • [49] Formation control of multi-robots for on-orbit assembly of large solar sails
    Hu, Quan
    Zhang, Yao
    Zhang, Jingrui
    Hu, Haiyan
    ACTA ASTRONAUTICA, 2016, 123 : 446 - 454
  • [50] Optimized distributed formation control using identifier-critic-actor reinforcement learning for a class of stochastic nonlinear multi-agent systems☆
    Wen, Guoxing
    Niu, Ben
    ISA TRANSACTIONS, 2024, 155 : 1 - 10