A Distributed Actor-Critic Learning Approach for Affine Formation Control of Multi-Robots With Unknown Dynamics

被引:1
|
作者
Zhang, Ronghua [1 ,2 ]
Ma, Qingwen [1 ]
Zhang, Xinglong [1 ]
Xu, Xin [1 ]
Liu, Daxue [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] Sichuan Univ Sci & Engn, Sch Mech Engn, Zigong, Peoples R China
关键词
affine formation control; data-driven; multi-robots; reinforcement learning; rollout; TIME NONLINEAR-SYSTEMS;
D O I
10.1002/acs.3972
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Formation maneuverability is particularly important for multi-robots (MRs), especially when the robots are operating cooperatively in complex and dynamic environments. Although various methods have been developed for affine formation, it is still a difficult problem to design an affine formation controller for MRs with unknown dynamics. In this paper, a distributed actor-critic learning approach (DACL) in a look-ahead rollout manner is proposed for the affine formation of MRs under local communication, which improves the online learning efficiency. In the proposed approach, a distributed data-driven online optimization mechanism is designed via the sparse kernel technique to solve the near-optimal affine formation control issue of MRs with unknown dynamics as well as improve control performance. The unknown dynamics of MRs are learned offline based on precollected input-output datasets, and the sparse kernel-based approach is employed to increase the feature representation capability of the samples. Then, the proposed distributed online actor-critic algorithm for each robot in the formation includes two neural networks, which are utilized to approximate the costate functions and the near-optimal policies. Moreover, the convergence analysis of the proposed approach has been conducted. Finally, numerical simulation and KKSwarm-based experiment studies are performed to verify the effectiveness of the proposed approach.
引用
收藏
页码:803 / 817
页数:15
相关论文
共 50 条
  • [21] An actor-critic approach for learning cooperative behaviors of multiagent seesaw balancing problems
    Kawakami, T
    Kinoshita, M
    Takatori, N
    Watanabe, M
    Furukawa, M
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 109 - 114
  • [22] Reinforcement learning for automatic quadrilateral mesh generation: A soft actor-critic approach
    Pan, Jie
    Huang, Jingwei
    Cheng, Gengdong
    Zeng, Yong
    NEURAL NETWORKS, 2023, 157 : 288 - 304
  • [23] Reinforcement learning control for coordinated manipulation of multi-robots
    Li, Yanan
    Chen, Long
    Tee, Keng Peng
    Li, Qingquan
    NEUROCOMPUTING, 2015, 170 : 168 - 175
  • [24] An inertia wheel pendulum control method based on actor-critic learning algorithm
    Liu Huanlong
    Wang Zhengjie
    Jiang Bin
    Peng Hongyu
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1281 - 1285
  • [25] Autonomous Gain Tuning for Differential Drive Robots Targeting Control using Soft Actor-Critic
    Peng, Chao-Chung
    Chiang, Meng-Huan
    Chen, Yi-Ho
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 255 - 260
  • [26] Adaptive optimal formation control for unmanned surface vehicles with guaranteed performance using actor-critic learning architecture
    Chen, Lin
    Dong, Chao
    He, Shude
    Dai, Shi-Lu
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (08) : 4504 - 4522
  • [27] Graph Soft Actor-Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination
    Hu, Yifan
    Fu, Junjie
    Wen, Guanghui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 665 - 676
  • [28] Adaptive Reinforcement Learning Formation Control Using ORFBLS for Omnidirectional Mobile Multi-Robots
    Tsai, Ching-Chih
    Chen, Hsing-Yi
    Chen, Shih-Che
    Tai, Feng-Chun
    Chen, Guan-Ming
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2023, 25 (05) : 1756 - 1769
  • [29] A Continuous Actor-Critic Reinforcement Learning Approach to Flocking with Fixed-Wing UAVs
    Wang, Chang
    Yan, Chao
    Xiang, Xiaojia
    Zhou, Han
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 64 - 79
  • [30] Heterogeneous trading strategies with adaptive fuzzy Actor-Critic reinforcement learning: A behavioral approach
    Bekiros, Stelios D.
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2010, 34 (06) : 1153 - 1170