Accelerating decentralized reinforcement learning of complex individual behaviors

被引:2
|
作者
Leottau, David L. [1 ]
Lobos-Tsunekawa, Kenzo [1 ]
Jaramillo, Francisco [1 ]
Ruiz-del-Solar, Javier [1 ]
机构
[1] Univ Chile, Adv Min Technol Ctr, Dept Elect Engn, Ave Tupper 2007, Santiago, Chile
关键词
Decentralized reinforcement learning; Multi-agent systems; Distributed control; Autonomous robots; Knowledge transfer; Distributed artificial intelligence;
D O I
10.1016/j.engappai.2019.06.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many Reinforcement Learning (RL) real-world applications have multi-dimensional action spaces which suffer from the combinatorial explosion of complexity. Then, it may turn infeasible to implement Centralized RL (CRL) systems due to the exponential increasing of dimensionality in both the state space and the action space, and the large number of training trials. In order to address this, this paper proposes to deal with these issues by using Decentralized Reinforcement Learning (DRL) to alleviate the effects of the curse of dimensionality on the action space, and by transferring knowledge to reduce the training episodes so that asymptotic converge can be achieved. Three DRL schemes are compared: DRL with independent learners and no prior-coordination (DRLInd); DRL accelerated-coordinated by using the Control Sharing (DRL+CoSh) Knowledge Transfer approach; and a proposed DRL scheme using the CoSh-based variant Nearby Action Sharing to include a measure of the uncertainty into the CoSh procedure (DRL+NeASh). These three schemes are analyzed through an extensive experimental study and validated through two complex real-world problems, namely the inwalk-kicking and the ball-dribbling behaviors, both performed with humanoid biped robots. Obtained results show (empirically): (i) the effectiveness of DRL systems which even without prior-coordination are able to achieve asymptotic convergence throughout indirect coordination; (ii) that by using the proposed knowledge transfer methods, it is possible to reduce the training episodes and to coordinate the DRL process; and (iii) obtained learning times are between 36% and 62% faster than the DRL-Ind schemes in the case studies.
引用
收藏
页码:243 / 253
页数:11
相关论文
共 50 条
  • [1] Decentralized Reinforcement Learning of Robot Behaviors
    Leottau, David L.
    Ruiz-del-Solar, Javier
    Babuska, Robert
    ARTIFICIAL INTELLIGENCE, 2018, 256 : 130 - 159
  • [2] Decentralized Reinforcement Learning Inspired by Multiagent Systems
    Adjodah, Dhaval
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1729 - 1730
  • [3] Towards Decentralized Reinforcement Learning Architectures for Social Dilemmas
    Anastassacos, Nicolas
    Musolesi, Mirco
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1776 - 1777
  • [4] Adaptive Learning: A New Decentralized Reinforcement Learning Approach for Cooperative Multiagent Systems
    Li, Meng-Lin
    Chen, Shaofei
    Chen, Jing
    IEEE ACCESS, 2020, 8 : 99404 - 99421
  • [5] V-Learning-A Simple, Efficient, Decentralized Algorithm for Multiagent Reinforcement Learning
    Jin, Chi
    Liu, Qinghua
    Wang, Yuanhao
    Yu, Tiancheng
    MATHEMATICS OF OPERATIONS RESEARCH, 2024, 49 (04) : 2295 - 2322
  • [6] Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous
    Wang, Rose E.
    Kew, J. Chase
    Lee, Dennis
    Lee, Tsang-Wei Edward
    Zhang, Tingnan
    Ichter, Brian
    Tan, Jie
    Faust, Aleksandra
    CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 711 - 725
  • [7] Attentive Relational State Representation in Decentralized Multiagent Reinforcement Learning
    Liu, Xiangyu
    Tan, Ying
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (01) : 252 - 264
  • [8] Aligning individual and collective welfare in complex socio-technical systems by combining metaheuristics and reinforcement learning
    Bazzan, Ana L. C.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 79 : 23 - 33
  • [9] Decentralized Incremental Fuzzy Reinforcement Learning for Multi-Agent Systems
    Hamzeloo, Sam
    Jahromi, Mansoor Zolghadri
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2020, 28 (01) : 79 - 98
  • [10] Inverse Reinforcement Learning for Decentralized Non-Cooperative Multiagent Systems
    Reddy, Tummalapalli Sudhamsh
    Gopikrishna, Vamsikrishna
    Zaruba, Gergely
    Huber, Manfred
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1930 - 1935