Coordinated behavior of cooperative agents using deep reinforcement learning

被引:11
|
作者
Diallo, Elhadji Amadou Oury [1 ]
Sugiyama, Ayumi [1 ]
Sugawara, Toshiharu [1 ]
机构
[1] Waseda Univ, Dept Comp Sci & Commun Engn, Shinjuku Ku, 3-4-1 Okubo, Tokyo 1698555, Japan
关键词
Deep reinforcement learning; Multi-agent systems; Cooperation; Coordination; LINEAR MULTIAGENT SYSTEMS; INTELLIGENCE; TAXONOMY;
D O I
10.1016/j.neucom.2018.08.094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we focus on an environment where multiple agents with complementary capabilities cooperate to generate non-conflicting joint actions that achieve a specific target. The central problem addressed is how several agents can collectively learn to coordinate their actions such that they complete a given task together without conflicts. However, sequential decision-making under uncertainty is one of the most challenging issues for intelligent cooperative systems. To address this, we propose a multi-agent concurrent framework where agents learn coordinated behaviors in order to divide their areas of responsibility. The proposed framework is an extension of some recent deep reinforcement learning algorithms such as DQN, double DQN, and dueling network architectures. Then, we investigate how the learned behaviors change according to the dynamics of the environment, reward scheme, and network structures. Next, we show how agents behave and choose their actions such that the resulting joint actions are optimal. We finally show that our method can lead to stable solutions in our specific environment. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:230 / 240
页数:11
相关论文
共 50 条
  • [41] Coordinated energy management for a cluster of buildings through deep reinforcement learning
    Pinto, Giuseppe
    Piscitelli, Marco Savino
    Vazquez-Canteli, Jose Ramon
    Nagy, Zoltan
    Capozzoli, Alfonso
    ENERGY, 2021, 229
  • [42] A Procedural Constructive Learning Mechanism with Deep Reinforcement Learning for Cognitive Agents
    Leonardo de Lellis Rossi
    Eric Rohmer
    Paula Dornhofer Paro Costa
    Esther Luna Colombini
    Alexandre da Silva Simões
    Ricardo Ribeiro Gudwin
    Journal of Intelligent & Robotic Systems, 2024, 110
  • [43] Cooperative Spectrum Sensing Meets Machine Learning: Deep Reinforcement Learning Approach
    Sarikhani, Rahil
    Keynia, Farshid
    IEEE COMMUNICATIONS LETTERS, 2020, 24 (07) : 1459 - 1462
  • [44] Cooperative Formation Control of a Multi-Agent Khepera IV Mobile Robots System Using Deep Reinforcement Learning
    Garcia, Gonzalo
    Eskandarian, Azim
    Fabregas, Ernesto
    Vargas, Hector
    Farias, Gonzalo
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [45] REIN-2: Giving birth to prepared reinforcement learning agents using reinforcement learning agents
    Lazaridis, Aristotelis
    Vlahavas, Ioannis
    NEUROCOMPUTING, 2022, 497 : 86 - 93
  • [46] Generating collective foraging behavior for robotic swarm using deep reinforcement learning
    Jin, Boyin
    Liang, Yupeng
    Han, Ziyao
    Ohkura, Kazuhiro
    ARTIFICIAL LIFE AND ROBOTICS, 2020, 25 (04) : 588 - 595
  • [47] Generating collective foraging behavior for robotic swarm using deep reinforcement learning
    Boyin Jin
    Yupeng Liang
    Ziyao Han
    Kazuhiro Ohkura
    Artificial Life and Robotics, 2020, 25 : 588 - 595
  • [48] Collective Behavior Acquisition of Real Robotic Swarms using Deep Reinforcement Learning
    Yasuda, Toshiyuki
    Ohkura, Kazuhiro
    2018 SECOND IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC), 2018, : 179 - 180
  • [49] Cooperative Autonomous Driving Control among Vehicles of Different Sizes Using Deep Reinforcement Learning
    Takenaka, Akito
    Harada, Tomohiro
    Miura, Yukiya
    Hattori, Kiyohiko
    Matuoka, Johei
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [50] A Stackelberg-based deep reinforcement learning approach for dynamic cooperative advertising in a two-echelon supply chain
    Zhou, Qiang
    Yang, Yefei
    Ma, Fangfang
    COMPUTERS & CHEMICAL ENGINEERING, 2025, 196