Optimal tracking agent: a new framework of reinforcement learning for multiagent systems

被引:3
|
作者
Cao, Weihua [1 ]
Chen, Gang [1 ]
Chen, Xin [1 ]
Wu, Min [1 ]
机构
[1] Cent South Univ, Inst Adv Control & Intelligent Automat, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
基金
高等学校博士学科点专项科研基金;
关键词
estimator; action selection mechanism; curse of dimensionality; optimal tracking agent; multiagent systems;
D O I
10.1002/cpe.2870
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
SUMMARYThe curse of dimensionality is a ubiquitous problem for multiagent reinforcement learning, which means the learning and storing space grows exponentially with the number of agents and hinders the application of multiagent reinforcement learning. To relieve this problem, we propose a new framework named as optimal tracking agent (OTA). The OTA views the other agents as part of the environment and uses a reduced form to learn the optimal decision. Although merging other agents into the environment may reduce the dimension of action space, the environment characterized by such form is dynamic and does not satisfy the convergence of reinforcement learning (RL). Thus, we develop an estimator to track the dynamics of the environment. The estimator obtains the dynamic model, and then the model-based RL can be used to react to the dynamic environment optimally. Because the Q-function in OTA is also a dynamic process because of other agents' dynamics, different from traditional RL, in which the learning is a stationary process and the usual action selection mechanisms just suit to such stationary process, we improve the greedy action selection mechanism to adapt to such dynamics. Thus, the OTA will have convergence. An experiment illustrates the validity and efficiency of the OTA.Copyright (c) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:2002 / 2015
页数:14
相关论文
共 50 条
  • [41] Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions
    Li, Jinna
    Yuan, Lin
    Cheng, Weiran
    Chai, Tianyou
    Lewis, Frank L.
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (11) : 6545 - 6558
  • [42] Reinforcement learning technique using agent state occurrence frequency with analysis of knowledge sharing on the agent’s learning process in multiagent environments
    H. S. Al-Dayaa
    D. B. Megherbi
    The Journal of Supercomputing, 2012, 59 : 526 - 547
  • [43] Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection
    Akramizadeh, Ali
    Afshar, Ahmad
    Menhaj, Mohammad Bagher
    Jafari, Samira
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (02): : 255 - 263
  • [44] A Collaborative Framework for Multiagent Systems
    Ahmed, Moamin
    Ahmad, Mohd Sharifuddin
    Yusoff, Mohd Zaliman M.
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT I, PROCEEDINGS, 2010, 5990 : 329 - 338
  • [45] Reinforcement learning technique using agent state occurrence frequency with analysis of knowledge sharing on the agent's learning process in multiagent environments
    Al-Dayaa, H. S.
    Megherbi, D. B.
    JOURNAL OF SUPERCOMPUTING, 2012, 59 (01) : 526 - 547
  • [46] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    Chakrabarti, Prasun
    Kurths, Juergen
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
  • [47] pFedEff: An Efficient and Personalized Federated Cognitive Learning Framework in Multiagent Systems
    Shi, Hongjian
    Zhang, Jianqing
    Fan, Shuming
    Ma, Ruhui
    Guan, Haibing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (01) : 31 - 45
  • [48] A unified framework for reinforcement learning, co-learning and meta-learning how to coordinate in collaborative multi-agent systems
    Tosic, Predrag T.
    Vilalta, Ricardo
    ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 2211 - 2220
  • [49] Deep multiagent reinforcement learning: challenges and directions
    Wong, Annie
    Back, Thomas
    Kononova, Anna, V
    Plaat, Aske
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (06) : 5023 - 5056
  • [50] Cognition-Oriented Multiagent Reinforcement Learning
    Qiu, Tenghai
    Wu, Shiguang
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Zhao, Yuqian
    Luo, Biao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,