An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning

被引:50
|
作者
Ishiwaka, Y
Sato, T
Kakazu, Y
机构
[1] Hakodate Natl Coll Technol, Hakodate, Hokkaido, Japan
[2] Future Univ Hakodate, Hakodate, Hokkaido, Japan
[3] Hokkaido Univ, Sapporo, Hokkaido, Japan
关键词
pursuit problem; prediction; Q-learning; emergence; heterogeneous multiagent system;
D O I
10.1016/S0921-8890(03)00040-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperation among agents is important for multiagent systems having a shared goal. In this paper, an example of the pursuit problem is studied, in which four hunters collaborate to catch a target. A reinforcement learning algorithm is employed to model how the hunters acquire this cooperative behavior to achieve the task. In order to apply Q-learning, which is one way of reinforcement learning, two kinds of prediction are needed for each hunter agent. One is the location of the other hunter agents and target agent, and the other is the movement direction of the target agent at next time step t. In our treatment we extend the standard problem to systems with heterogeneous agents. One motivation for this is that the target agent and hunter agents have differing abilities. In addition, even though those hunter agents are homogeneous at the beginning of the problem, their abilities become heterogeneous in the learning process. Simulations of this pursuit problem were performed on a continuous action state space, the results of which are displayed, accompanied by a discussion of their outcomes' dependence upon the initial locations of the hunters and the speeds of the hunters and a target. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:245 / 256
页数:12
相关论文
共 50 条
  • [21] An Improved Reinforcement Learning System Using Affective Factors
    Kuremoto, Takashi
    Tsurusaki, Tetsuya
    Kobayashi, Kunikazu
    Mabu, Shingo
    Obayashi, Masanao
    ROBOTICS, 2013, 2 (03): : 149 - 164
  • [22] Adaptive Individual Q-Learning-A Multiagent Reinforcement Learning Method for Coordination Optimization
    Zhang, Zhen
    Wang, Dongqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12
  • [23] Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch
    Xu, Yinliang
    Zhang, Wei
    Liu, Wenxin
    Ferrese, Frank
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1742 - 1751
  • [24] Reinforcement Learning Approach to Feedback Stabilization Problem of Probabilitic Boolcan Control Networks
    Acernese, Antonio
    Yerudkar, Amol
    Glielmo, Luigi
    Del Vecchio, Carmen Del
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (01): : 337 - 342
  • [25] FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks
    Zhang, Zhen
    Zhao, Dongbin
    Gao, Junwei
    Wang, Dongqing
    Dai, Yujie
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) : 1367 - 1379
  • [27] VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning
    Wei, Qinglai
    Li, Yugu
    Zhang, Jie
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 182 - 195
  • [28] A reinforcement learning approach to dairy farm battery management using Q learning
    Ali, Nawazish
    Wahid, Abdul
    Shaw, Rachael
    Mason, Karl
    JOURNAL OF ENERGY STORAGE, 2024, 93
  • [29] A Forecasting Approach to Cryptocurrency Price Index Using Reinforcement Learning
    Mariappan, L. Thanga
    Pandian, J. Arun
    Kumar, V. Dhilip
    Geman, Oana
    Chiuchisan, Iuliana
    Nastase, Carmen
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [30] A reinforcement learning based computational intelligence approach for binary optimization problems: The case of the set-union knapsack problem
    Ozsoydan, Fehmi Burcin
    Golcuk, Ilker
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 118