Two-stage reward allocation with decay for multi-agent coordinated behavior for sequential cooperative task by using deep reinforcement learning

被引：0

作者：

Miyashita Y. ^{[1
,2
]}

Sugawara T. ^{[1
]}

机构：

[1] Department of Computer Science and Communications Engineering, Waseda University, Tokyo

[2] Shimizu Corporation, Tokyo

来源：

Autonomous Intelligent Systems | / 2卷 / 1期

基金：

日本学术振兴会;

关键词：

Cooperation; Coordination; Divisional cooperation; Multi-agent deep reinforcement learning;

D O I：

10.1007/s43684-022-00029-z

中图分类号：

学科分类号：

摘要：

We propose a two-stage reward allocation method with decay using an extension of replay memory to adapt this rewarding method for deep reinforcement learning (DRL), to generate coordinated behaviors for tasks that can be completed by executing a few subtasks sequentially by heterogeneous agents. An independent learner in cooperative multi-agent systems needs to learn its policies for effective execution of its own responsible subtask, as well as for coordinated behaviors under a certain coordination structure. Although the reward scheme is an issue for DRL, it is difficult to design it to learn both policies. Our proposed method attempts to generate these different behaviors in multi-agent DRL by dividing the timing of rewards into two stages and varying the ratio between them over time. By introducing the coordinated delivery and execution problem with an expiration time, where a task can be executed sequentially by two heterogeneous agents, we experimentally analyze the effect of using various ratios of the reward division in the two-stage allocations on the generated behaviors. The results demonstrate that the proposed method could improve the overall performance relative to those with the conventional one-time or fixed reward and can establish robust coordinated behavior. © 2022, The Author(s).

引用

共 50 条

[21] RESOURCE ALLOCATION OPTIMIZATION FOR EFFECTIVE VEHICLE NETWORK COMMUNICATIONS USING MULTI-AGENT DEEP REINFORCEMENT LEARNING
Ergun, Serap
JOURNAL OF DYNAMICS AND GAMES, 2025, 12 (02): : 134 - 156
[22] Multi-agent deep reinforcement learning for task offloading in group distributed manufacturing systems
Xiong, Jianyu
Guo, Peng
Wang, Yi
Meng, Xiangyin
Zhang, Jian
Qian, Linmao
Yu, Zhenglin
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 118
[23] Parameter Sharing Empowered Multi-agent Deep Reinforcement Learning for Coordinated Management of Energy Communities
Ye Y.
Yuan Q.
Liu W.
Tang Y.
Goran S.
Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2022, 42 (21): : 7682 - 7694
[24] MAT-DQN: Toward Interpretable Multi-agent Deep Reinforcement Learning for Coordinated Activities
Motokawa, Yoshinari
Sugawara, Toshiharu
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 556 - 567
[25] Coordinated Variable Speed Limit Control for Freeway Based on Multi-Agent Deep Reinforcement Learning
Yu, Rongjie
Xu, Ling
Zhang, Ruici
Tongji Daxue Xuebao/Journal of Tongji University, 2024, 52 (07): : 1089 - 1098
[26] Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks
Zhao, Nan
Liu, Zehua
Cheng, Yiqiang
IEEE ACCESS, 2020, 8 : 139670 - 139679
[27] Collaborative Task Offloading Optimization for Satellite Mobile Edge Computing Using Multi-Agent Deep Reinforcement Learning
Zhang, Hangyu
Zhao, Hongbo
Liu, Rongke
Kaushik, Aryan
Gao, Xiangqiang
Xu, Shenzhan
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) : 15483 - 15498
[28] Energy Harvesting Design for Cooperative Reconfigurable Intelligent Surface with Multi-Agent Deep Reinforcement Learning
Tao, Yihang
Wu, Jun
Pan, Qianqian
Chen, Xiuzhen
PROCEEDINGS OF THE 2024 IEEE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS 2024, 2024, : 42 - 46
[29] Cooperative Sensing and Heterogeneous Information Fusion in VCPS: A Multi-Agent Deep Reinforcement Learning Approach
Xu, Xincao
Liu, Kai
Dai, Penglin
Xie, Ruitao
Cao, Jingjing
Luo, Jiangtao
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 4876 - 4891
[30] Power Allocation for Millimeter-Wave Railway Systems with Multi-Agent Deep Reinforcement Learning
Xu, Jianpeng
Ai, Bo
Sun, Yannan
Chen, Yali
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,

← 1 2 3 4 5 →