Optimal tracking agent: a new framework of reinforcement learning for multiagent systems

被引:3
|
作者
Cao, Weihua [1 ]
Chen, Gang [1 ]
Chen, Xin [1 ]
Wu, Min [1 ]
机构
[1] Cent South Univ, Inst Adv Control & Intelligent Automat, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
基金
高等学校博士学科点专项科研基金;
关键词
estimator; action selection mechanism; curse of dimensionality; optimal tracking agent; multiagent systems;
D O I
10.1002/cpe.2870
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
SUMMARYThe curse of dimensionality is a ubiquitous problem for multiagent reinforcement learning, which means the learning and storing space grows exponentially with the number of agents and hinders the application of multiagent reinforcement learning. To relieve this problem, we propose a new framework named as optimal tracking agent (OTA). The OTA views the other agents as part of the environment and uses a reduced form to learn the optimal decision. Although merging other agents into the environment may reduce the dimension of action space, the environment characterized by such form is dynamic and does not satisfy the convergence of reinforcement learning (RL). Thus, we develop an estimator to track the dynamics of the environment. The estimator obtains the dynamic model, and then the model-based RL can be used to react to the dynamic environment optimally. Because the Q-function in OTA is also a dynamic process because of other agents' dynamics, different from traditional RL, in which the learning is a stationary process and the usual action selection mechanisms just suit to such stationary process, we improve the greedy action selection mechanism to adapt to such dynamics. Thus, the OTA will have convergence. An experiment illustrates the validity and efficiency of the OTA.Copyright (c) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:2002 / 2015
页数:14
相关论文
共 50 条
  • [31] Consensus Tracking of Disturbed Second-Order Multiagent Systems With Actuator Attacks: Reinforcement-Learning-Based Approach
    Liu, Huawei
    Wen, Guanghui
    Fu, Junjie
    Luo, Zhexin
    Zheng, Dezhi
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025,
  • [32] Lateral Transfer Learning for Multiagent Reinforcement Learning
    Shi, Haobin
    Li, Jingchen
    Mao, Jiahui
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
  • [33] Optimal Control for Multi-agent Systems Using Off-Policy Reinforcement Learning
    Wang, Hao
    Chen, Zhiru
    Wang, Jun
    Lu, Lijun
    Li, Mingzhe
    2022 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR, 2022, : 135 - 140
  • [34] Resilient adaptive optimal control of distributed multi-agent systems using reinforcement learning
    Moghadam, Rohollah
    Modares, Hamidreza
    IET CONTROL THEORY AND APPLICATIONS, 2018, 12 (16) : 2165 - 2174
  • [35] Interaction Models for Multiagent Reinforcement Learning
    Ribeiro, Richardson
    Borges, Andre P.
    Enembreck, Fabricio
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 464 - +
  • [36] A REINFORCEMENT LEARNING APPROACH FOR MULTIAGENT NAVIGATION
    Martinez-Gil, Francisco
    Barber, Fernando
    Lozano, Miguel
    Grimaldo, Francisco
    Fernandez, Fernando
    ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 607 - 610
  • [37] A comprehensive survey of multiagent reinforcement learning
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172
  • [38] Tracking Algorithms for Multiagent Systems
    Meng, Deyuan
    Jia, Yingmin
    Du, Junping
    Yu, Fashan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (10) : 1660 - 1676
  • [39] MARRGM: Learning Framework for Multi-Agent Reinforcement Learning via Reinforcement Recommendation and Group Modification
    Wu, Peiliang
    Tian, Liqiang
    Zhang, Qian
    Mao, Bingyi
    Chen, Wenbai
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (06) : 5385 - 5392
  • [40] A Decentralized Communication Framework Based on Dual-Level Recurrence for Multiagent Reinforcement Learning
    Li, Xuesi
    Li, Jingchen
    Shi, Haobin
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (02) : 640 - 649