Distributed reinforcement learning in multi-agent networks

被引:0
作者
Kar, Soummya [1 ]
Moura, Jose M. F. [1 ]
Poor, H. Vincent [2 ]
机构
[1] Carnegie Mellon Univ, Dept ECE, Pittsburgh, PA 15213 USA
[2] Princeton Univ, Dept EE, Princeton, NJ 08544 USA
来源
2013 IEEE 5TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2013) | 2013年
基金
美国国家科学基金会;
关键词
Multi-agent stochastic control; distributed Q-learning; reinforcement learning; collaborative network processing; consensus plus innovations; distributed stochastic approximation;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed reinforcement learning algorithms for collaborative multi-agent Markov decision processes (MDPs) are presented and analyzed. The networked setup consists of a collection of agents (learners) which respond differently (depending on their instantaneous one-stage random costs) to a global controlled state and the control actions of a remote controller. With the objective of jointly learning the optimal stationary control policy (in the absence of global state transition and local agent cost statistics) that minimizes network-averaged infinite horizon discounted cost, the paper presents distributed variants of Q-learning of the consensus + innovations type in which each agent sequentially refines its learning parameters by locally processing its instantaneous payoff data and the information received from neighboring agents. Under broad conditions on the multi-agent decision model and mean connectivity of the inter-agent communication network, the proposed distributed algorithms are shown to achieve optimal learning asymptotically, i. e., almost surely (a. s.) each network agent is shown to learn the value function and the optimal stationary control policy of the collaborative MDP asymptotically. Further, convergence rate estimates for the proposed class of distributed learning algorithms are obtained.
引用
收藏
页码:296 / +
页数:2
相关论文
共 50 条
  • [1] QD-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus plus Innovations
    Kar, Soummya
    Moura, Jose M. F.
    Poor, H. Vincent
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2013, 61 (07) : 1848 - 1862
  • [2] Distributed Transmission Control for Wireless Networks using Multi-Agent Reinforcement Learning
    Farquhar, Collin
    Kumar, Prem
    Jagannath, Anu
    Jagannath, Jithin
    BIG DATA IV: LEARNING, ANALYTICS, AND APPLICATIONS, 2022, 12097
  • [3] Distributed Cooperative Spectrum Sharing in UAV Networks Using Multi-Agent Reinforcement Learning
    Shamsoshoara, Alireza
    Khaledi, Mehrdad
    Afghah, Fatemeh
    Razi, Abolfazl
    Ashdown, Jonathan
    2019 16TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2019,
  • [4] Distributed hierarchical reinforcement learning in multi-agent adversarial environments
    Naderializadeh, Navid
    Soleyman, Sean
    Hung, Fan
    Khosla, Deepak
    Chen, Yang
    Fadaie, Joshua G.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS IV, 2022, 12113
  • [5] Distributed Traffic Engineering in Hybrid Software Defined Networks: A Multi-Agent Reinforcement Learning Framework
    Guo, Yingya
    Lin, Bin
    Tang, Qi
    Ma, Yulong
    Luo, Huan
    Tian, Han
    Chen, Kai
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (06): : 6759 - 6769
  • [6] Transform networks for cooperative multi-agent deep reinforcement learning
    Wang, Hongbin
    Xie, Xiaodong
    Zhou, Lianke
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9261 - 9269
  • [7] A Survey on Multi-Agent Reinforcement Learning Methods for Vehicular Networks
    Althamary, Ibrahim
    Huang, Chih-Wei
    Lin, Phone
    2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1154 - 1159
  • [8] Transform networks for cooperative multi-agent deep reinforcement learning
    Hongbin Wang
    Xiaodong Xie
    Lianke Zhou
    Applied Intelligence, 2023, 53 : 9261 - 9269
  • [9] Distributed cooperative reinforcement learning for multi-agent system with collision avoidance
    Lan, Xuejing
    Yan, Jiapei
    He, Shude
    Zhao, Zhijia
    Zou, Tao
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (01) : 567 - 585
  • [10] Dynamic distributed constraint optimization using multi-agent reinforcement learning
    Shokoohi, Maryam
    Afsharchi, Mohsen
    Shah-Hoseini, Hamed
    SOFT COMPUTING, 2022, 26 (08) : 3601 - 3629