Two-stage reward allocation with decay for multi-agent coordinated behavior for sequential cooperative task by using deep reinforcement learning

被引:0
|
作者
Miyashita Y. [1 ,2 ]
Sugawara T. [1 ]
机构
[1] Department of Computer Science and Communications Engineering, Waseda University, Tokyo
[2] Shimizu Corporation, Tokyo
来源
Autonomous Intelligent Systems | / 2卷 / 1期
基金
日本学术振兴会;
关键词
Cooperation; Coordination; Divisional cooperation; Multi-agent deep reinforcement learning;
D O I
10.1007/s43684-022-00029-z
中图分类号
学科分类号
摘要
We propose a two-stage reward allocation method with decay using an extension of replay memory to adapt this rewarding method for deep reinforcement learning (DRL), to generate coordinated behaviors for tasks that can be completed by executing a few subtasks sequentially by heterogeneous agents. An independent learner in cooperative multi-agent systems needs to learn its policies for effective execution of its own responsible subtask, as well as for coordinated behaviors under a certain coordination structure. Although the reward scheme is an issue for DRL, it is difficult to design it to learn both policies. Our proposed method attempts to generate these different behaviors in multi-agent DRL by dividing the timing of rewards into two stages and varying the ratio between them over time. By introducing the coordinated delivery and execution problem with an expiration time, where a task can be executed sequentially by two heterogeneous agents, we experimentally analyze the effect of using various ratios of the reward division in the two-stage allocations on the generated behaviors. The results demonstrate that the proposed method could improve the overall performance relative to those with the conventional one-time or fixed reward and can establish robust coordinated behavior. © 2022, The Author(s).
引用
收藏
相关论文
共 50 条
  • [21] RESOURCE ALLOCATION OPTIMIZATION FOR EFFECTIVE VEHICLE NETWORK COMMUNICATIONS USING MULTI-AGENT DEEP REINFORCEMENT LEARNING
    Ergun, Serap
    JOURNAL OF DYNAMICS AND GAMES, 2025, 12 (02): : 134 - 156
  • [22] Multi-agent deep reinforcement learning for task offloading in group distributed manufacturing systems
    Xiong, Jianyu
    Guo, Peng
    Wang, Yi
    Meng, Xiangyin
    Zhang, Jian
    Qian, Linmao
    Yu, Zhenglin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 118
  • [23] Parameter Sharing Empowered Multi-agent Deep Reinforcement Learning for Coordinated Management of Energy Communities
    Ye Y.
    Yuan Q.
    Liu W.
    Tang Y.
    Goran S.
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2022, 42 (21): : 7682 - 7694
  • [24] MAT-DQN: Toward Interpretable Multi-agent Deep Reinforcement Learning for Coordinated Activities
    Motokawa, Yoshinari
    Sugawara, Toshiharu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 556 - 567
  • [25] Coordinated Variable Speed Limit Control for Freeway Based on Multi-Agent Deep Reinforcement Learning
    Yu, Rongjie
    Xu, Ling
    Zhang, Ruici
    Tongji Daxue Xuebao/Journal of Tongji University, 2024, 52 (07): : 1089 - 1098
  • [26] Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks
    Zhao, Nan
    Liu, Zehua
    Cheng, Yiqiang
    IEEE ACCESS, 2020, 8 : 139670 - 139679
  • [27] Collaborative Task Offloading Optimization for Satellite Mobile Edge Computing Using Multi-Agent Deep Reinforcement Learning
    Zhang, Hangyu
    Zhao, Hongbo
    Liu, Rongke
    Kaushik, Aryan
    Gao, Xiangqiang
    Xu, Shenzhan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) : 15483 - 15498
  • [28] Energy Harvesting Design for Cooperative Reconfigurable Intelligent Surface with Multi-Agent Deep Reinforcement Learning
    Tao, Yihang
    Wu, Jun
    Pan, Qianqian
    Chen, Xiuzhen
    PROCEEDINGS OF THE 2024 IEEE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS 2024, 2024, : 42 - 46
  • [29] Cooperative Sensing and Heterogeneous Information Fusion in VCPS: A Multi-Agent Deep Reinforcement Learning Approach
    Xu, Xincao
    Liu, Kai
    Dai, Penglin
    Xie, Ruitao
    Cao, Jingjing
    Luo, Jiangtao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 4876 - 4891
  • [30] Power Allocation for Millimeter-Wave Railway Systems with Multi-Agent Deep Reinforcement Learning
    Xu, Jianpeng
    Ai, Bo
    Sun, Yannan
    Chen, Yali
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,