Cooperative reinforcement learning in topology-based multi-agent systems

被引:8
作者
Xiao, Dan [1 ]
Tan, Ah-Hwee [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
Topology-based multi-agent systems; Cooperative learning; Reinforcement learning; Binary tree formation; Policy sharing; SUPPLY CHAIN; ALGORITHM; ARCHITECTURE;
D O I
10.1007/s10458-011-9183-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topology-based multi-agent systems (TMAS), wherein agents interact with one another according to their spatial relationship in a network, are well suited for problems with topological constraints. In a TMAS system, however, each agent may have a different state space, which can be rather large. Consequently, traditional approaches to multi-agent cooperative learning may not be able to scale up with the complexity of the network topology. In this paper, we propose a cooperative learning strategy, under which autonomous agents are assembled in a binary tree formation (BTF). By constraining the interaction between agents, we effectively unify the state space of individual agents and enable policy sharing across agents. Our complexity analysis indicates that multi-agent systems with the BTF have a much smaller state space and a higher level of flexibility, compared with the general form of n-ary (n > 2) tree formation. We have applied the proposed cooperative learning strategy to a class of reinforcement learning agents known as temporal difference-fusion architecture for learning and cognition (TD-FALCON). Comparative experiments based on a generic network routing problem, which is a typical TMAS domain, show that the TD-FALCON BTF teams outperform alternative methods, including TD-FALCON teams in single agent and n-ary tree formation, a Q-learning method based on the table lookup mechanism, as well as a classical linear programming algorithm. Our study further shows that TD-FALCON BTF can adapt and function well under various scales of network complexity and traffic volume in TMAS domains.
引用
收藏
页码:86 / 119
页数:34
相关论文
共 50 条
  • [31] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward
    Shao, Kun
    Zhu, Yuanheng
    Tang, Zhentao
    Zhao, Dongbin
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [32] Knowledge Reuse of Multi-Agent Reinforcement Learning in Cooperative Tasks
    Shi, Daming
    Tong, Junbo
    Liu, Yi
    Fan, Wenhui
    [J]. ENTROPY, 2022, 24 (04)
  • [33] WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks
    Liu, Hui
    Zhang, Zhen
    Wang, Dongqing
    [J]. IEEE ACCESS, 2020, 8 : 216320 - 216331
  • [34] Social Interaction of Cooperative Communication and Group Generation in Multi-Agent Reinforcement Learning Systems
    Zhang, Kun
    Maeda, Yoichiro
    Takahashi, Yasutake
    [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1350 - 1355
  • [35] Cooperative Multi-agent Reinforcement Learning Models (CMRLM) for Intelligent Traffic Control
    Vidhate, Deepak A.
    Kulkarni, Parag
    [J]. 2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 325 - 331
  • [36] Barrier function based safe reinforcement learning for multi-agent systems
    Yao, Ying
    Zhang, Dianfeng
    Wu, Zhaojing
    Shao, Guangru
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1714 - 1721
  • [37] A Data Enhancement Strategy for Multi-Agent Cooperative Hunting based on Deep Reinforcement Learning
    Gao, Zhenkun
    Dai, Xiaoyan
    Yao, Meibao
    Xiao, Xueming
    [J]. 2023 IEEE 6TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS, 2023,
  • [38] Towards reinforcement learning for holonic multi-agent systems
    Abdoos, Monireh
    Mozayani, Nasser
    Bazzan, Ana L. C.
    [J]. INTELLIGENT DATA ANALYSIS, 2015, 19 (02) : 211 - 232
  • [39] Shaping multi-agent systems with gradient reinforcement learning
    Olivier Buffet
    Alain Dutech
    François Charpillet
    [J]. Autonomous Agents and Multi-Agent Systems, 2007, 15 : 197 - 220
  • [40] A cooperative jamming decision-making method based on multi-agent reinforcement learning
    Bingchen Cai
    Haoran Li
    Naimin Zhang
    Mingyu Cao
    Han Yu
    [J]. Autonomous Intelligent Systems, 5 (1):