Accelerating decentralized reinforcement learning of complex individual behaviors

被引:2
|
作者
Leottau, David L. [1 ]
Lobos-Tsunekawa, Kenzo [1 ]
Jaramillo, Francisco [1 ]
Ruiz-del-Solar, Javier [1 ]
机构
[1] Univ Chile, Adv Min Technol Ctr, Dept Elect Engn, Ave Tupper 2007, Santiago, Chile
关键词
Decentralized reinforcement learning; Multi-agent systems; Distributed control; Autonomous robots; Knowledge transfer; Distributed artificial intelligence;
D O I
10.1016/j.engappai.2019.06.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many Reinforcement Learning (RL) real-world applications have multi-dimensional action spaces which suffer from the combinatorial explosion of complexity. Then, it may turn infeasible to implement Centralized RL (CRL) systems due to the exponential increasing of dimensionality in both the state space and the action space, and the large number of training trials. In order to address this, this paper proposes to deal with these issues by using Decentralized Reinforcement Learning (DRL) to alleviate the effects of the curse of dimensionality on the action space, and by transferring knowledge to reduce the training episodes so that asymptotic converge can be achieved. Three DRL schemes are compared: DRL with independent learners and no prior-coordination (DRLInd); DRL accelerated-coordinated by using the Control Sharing (DRL+CoSh) Knowledge Transfer approach; and a proposed DRL scheme using the CoSh-based variant Nearby Action Sharing to include a measure of the uncertainty into the CoSh procedure (DRL+NeASh). These three schemes are analyzed through an extensive experimental study and validated through two complex real-world problems, namely the inwalk-kicking and the ball-dribbling behaviors, both performed with humanoid biped robots. Obtained results show (empirically): (i) the effectiveness of DRL systems which even without prior-coordination are able to achieve asymptotic convergence throughout indirect coordination; (ii) that by using the proposed knowledge transfer methods, it is possible to reduce the training episodes and to coordinate the DRL process; and (iii) obtained learning times are between 36% and 62% faster than the DRL-Ind schemes in the case studies.
引用
收藏
页码:243 / 253
页数:11
相关论文
共 50 条
  • [41] Decentralized learning in Markov games
    Vrancx, Peter
    Verbeeck, Katja
    Nowe, Ann
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 976 - 981
  • [42] Energy Cost Minimization Based on Decentralized Reinforcement Learning With Feedback for Stable Operation of Wireless Charging Electric Tram Network
    Lee, Hyukjoon
    Cho, Dong-Ho
    IEEE SYSTEMS JOURNAL, 2021, 15 (01): : 586 - 597
  • [43] Multi-agent reinforcement learning: A survey
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +
  • [44] Distributed multi-agent deep reinforcement learning for cooperative multi-robot pursuit
    Yu, Chao
    Dong, Yinzhao
    Li, Yangning
    Chen, Yatong
    JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 499 - 504
  • [45] DISTRIBUTED REINFORCEMENT LEARNING
    WEISS, G
    ROBOTICS AND AUTONOMOUS SYSTEMS, 1995, 15 (1-2) : 135 - 142
  • [46] Reinforcement learning and aggregation
    Jiang, J
    Kamel, M
    Chen, L
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 1303 - 1308
  • [47] Experience Sharing Between Cooperative Reinforcement Learning Agents
    Souza, Lucas Oliveira
    Ramos, Gabriel de Oliveira
    Ralha, Celia Ghedini
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 963 - 970
  • [48] Implementation of Coordinated Complex Dynamic Behaviors in Multirobot Systems
    Sabattini, Lorenzo
    Secchi, Cristian
    Cocetti, Matteo
    Levratti, Alessio
    Fantuzzi, Cesare
    IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (04) : 1018 - 1032
  • [49] Cooperative Dynamic Behaviors in Networked Systems with Decentralized State Estimation
    Sabattini, Lorenzo
    Secchi, Cristian
    Fantuzzi, Cesare
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 3782 - 3787
  • [50] Decentralized Learning for Multiplayer Multiarmed Bandits
    Kalathil, Dileep
    Nayyar, Naumaan
    Jain, Rahul
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (04) : 2331 - 2345