Accelerating decentralized reinforcement learning of complex individual behaviors

被引:2
|
作者
Leottau, David L. [1 ]
Lobos-Tsunekawa, Kenzo [1 ]
Jaramillo, Francisco [1 ]
Ruiz-del-Solar, Javier [1 ]
机构
[1] Univ Chile, Adv Min Technol Ctr, Dept Elect Engn, Ave Tupper 2007, Santiago, Chile
关键词
Decentralized reinforcement learning; Multi-agent systems; Distributed control; Autonomous robots; Knowledge transfer; Distributed artificial intelligence;
D O I
10.1016/j.engappai.2019.06.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many Reinforcement Learning (RL) real-world applications have multi-dimensional action spaces which suffer from the combinatorial explosion of complexity. Then, it may turn infeasible to implement Centralized RL (CRL) systems due to the exponential increasing of dimensionality in both the state space and the action space, and the large number of training trials. In order to address this, this paper proposes to deal with these issues by using Decentralized Reinforcement Learning (DRL) to alleviate the effects of the curse of dimensionality on the action space, and by transferring knowledge to reduce the training episodes so that asymptotic converge can be achieved. Three DRL schemes are compared: DRL with independent learners and no prior-coordination (DRLInd); DRL accelerated-coordinated by using the Control Sharing (DRL+CoSh) Knowledge Transfer approach; and a proposed DRL scheme using the CoSh-based variant Nearby Action Sharing to include a measure of the uncertainty into the CoSh procedure (DRL+NeASh). These three schemes are analyzed through an extensive experimental study and validated through two complex real-world problems, namely the inwalk-kicking and the ball-dribbling behaviors, both performed with humanoid biped robots. Obtained results show (empirically): (i) the effectiveness of DRL systems which even without prior-coordination are able to achieve asymptotic convergence throughout indirect coordination; (ii) that by using the proposed knowledge transfer methods, it is possible to reduce the training episodes and to coordinate the DRL process; and (iii) obtained learning times are between 36% and 62% faster than the DRL-Ind schemes in the case studies.
引用
收藏
页码:243 / 253
页数:11
相关论文
共 50 条
  • [21] Finite-Sample Analysis for Decentralized Batch Multiagent Reinforcement Learning With Networked Agents
    Zhang, Kaiqing
    Yang, Zhuoran
    Liu, Han
    Zhang, Tong
    Basar, Tamer
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (12) : 5925 - 5940
  • [22] Decentralized Multi Agent Deep Reinforcement Q-Learning for Intelligent Traffic Controller
    Thamilselvam, B.
    Kalyanasundaram, Subrahmanyam
    Rao, M. V. Panduranga
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT I, 2023, 675 : 45 - 56
  • [23] Decentralized multi-agent based energy management of microgrid using reinforcement learning
    Samadi, Esmat
    Badri, Ali
    Ebrahimpour, Reza
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2020, 122
  • [24] A Decentralized Communication Framework Based on Dual-Level Recurrence for Multiagent Reinforcement Learning
    Li, Xuesi
    Li, Jingchen
    Shi, Haobin
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (02) : 640 - 649
  • [25] Accelerating the Computation of Solutions in Resource Allocation Problems Using an Evolutionary Approach and Multiagent Reinforcement Learning
    Bazzan, Ana L. C.
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 185 - 201
  • [26] Decentralized Computation Offloading with Cooperative UAVs: Multi-Agent Deep Reinforcement Learning Perspective
    Hwang, Sangwon
    Lee, Hoon
    Park, Juseong
    Lee, Inkyu
    IEEE WIRELESS COMMUNICATIONS, 2022, 29 (04) : 24 - 31
  • [27] Large-Scale Traffic Grid Signal Control Using Decentralized Fuzzy Reinforcement Learning
    Tan, Tian
    Chu, TianShu
    Peng, Bo
    Wang, Jie
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 1, 2018, 15 : 652 - 662
  • [28] Toward Real-Time Decentralized Reinforcement Learning Using Finite Support Basis Functions
    Lobos-Tsunekawa, Kenzo
    Leottau, David L.
    Ruiz-del-Solar, Javier
    ROBOCUP 2017: ROBOT WORLD CUP XXI, 2018, 11175 : 95 - 107
  • [29] Decentralized, Safe, Multiagent Motion Planning for Drones Under Uncertainty via Filtered Reinforcement Learning
    Vinod, Abraham P.
    Safaoui, Sleiman
    Summers, Tyler H.
    Yoshikawa, Nobuyuki
    Di Cairano, Stefano
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024, 32 (06) : 2492 - 2499
  • [30] Decentralized collaborative optimal scheduling for EV charging stations based on multi-agent reinforcement learning
    Li, Hang
    Han, Bei
    Li, Guojie
    Wang, Keyou
    Xu, Jin
    Khan, Muhammad Waseem
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2024, 18 (06) : 1172 - 1183