An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning

被引:50
|
作者
Ishiwaka, Y
Sato, T
Kakazu, Y
机构
[1] Hakodate Natl Coll Technol, Hakodate, Hokkaido, Japan
[2] Future Univ Hakodate, Hakodate, Hokkaido, Japan
[3] Hokkaido Univ, Sapporo, Hokkaido, Japan
关键词
pursuit problem; prediction; Q-learning; emergence; heterogeneous multiagent system;
D O I
10.1016/S0921-8890(03)00040-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperation among agents is important for multiagent systems having a shared goal. In this paper, an example of the pursuit problem is studied, in which four hunters collaborate to catch a target. A reinforcement learning algorithm is employed to model how the hunters acquire this cooperative behavior to achieve the task. In order to apply Q-learning, which is one way of reinforcement learning, two kinds of prediction are needed for each hunter agent. One is the location of the other hunter agents and target agent, and the other is the movement direction of the target agent at next time step t. In our treatment we extend the standard problem to systems with heterogeneous agents. One motivation for this is that the target agent and hunter agents have differing abilities. In addition, even though those hunter agents are homogeneous at the beginning of the problem, their abilities become heterogeneous in the learning process. Simulations of this pursuit problem were performed on a continuous action state space, the results of which are displayed, accompanied by a discussion of their outcomes' dependence upon the initial locations of the hunters and the speeds of the hunters and a target. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:245 / 256
页数:12
相关论文
共 50 条
  • [41] A,Multiagent approach to Q-learning for daily stock trading
    Lee, Jae Won
    Park, Jonghun
    O, Jangmin
    Lee, Jongwoo
    Hong, Euyseok
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2007, 37 (06): : 864 - 877
  • [42] Hierarchical Multi-Robot Pursuit with Deep Reinforcement Learning and Navigation Planning
    Chen, Wenzhang
    Zhu, Yuanheng
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 1274 - 1280
  • [43] Design of an Adaptive e-Learning System based on Multi-Agent Approach and Reinforcement Learning
    El Fazazi, Hanaa
    Elgarej, Mouhcine
    Qbadou, Mohamed
    Mansouri, Khalifa
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2021, 11 (01) : 6637 - 6644
  • [44] Promoting the Emergence of Behavior Norms in a Principal-Agent Problem-An Agent-Based Modeling Approach Using Reinforcement Learning
    Harati, Saeed
    Perez, Liliana
    Molowny-Horas, Roberto
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [45] Monopoly Using Reinforcement Learning
    Arun, Edupuganti
    Rajesh, Harikrishna
    Chakrabarti, Debarka
    Cherala, Harikiran
    George, Koshy
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 864 - 868
  • [46] Real-time scheduling for a smart factory using a reinforcement learning approach
    Shiue, Yeou-Ren
    Lee, Ken-Chuan
    Su, Chao-Ton
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 125 : 604 - 614
  • [47] Desertification Control Strategies: A Hybrid Approach Using Cellular Automata and Reinforcement Learning
    Mouakher, Amira
    Kone, Alassane
    Fontaine, Allyx
    El Yacoubi, Samira
    CELLULAR AUTOMATA, ACRI 2024, 2024, 14978 : 203 - 216
  • [48] Modelling building HVAC control strategies using a deep reinforcement learning approach
    Nguyen, Anh Tuan
    Pham, Duy Hoang
    Oo, Bee Lan
    Santamouris, Mattheos
    Ahn, Yonghan
    Lim, Benson T. H.
    ENERGY AND BUILDINGS, 2024, 310
  • [49] A practical heterogeneous network optimisation algorithm based on reinforcement learning
    Feng, Zhiyong
    Tan, Li
    Li, Wei
    Gulliver, T. Aaron
    Liang, Litao
    INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2011, 6 (04) : 357 - 372
  • [50] Towards an Adaptive e-Learning System Based on Deep Learner Profile, Machine Learning Approach, and Reinforcement Learning
    Mustapha, Riad
    Soukaina, Gouraguine
    Mohammed, Qbadou
    Es-Saadia, Aoula
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 265 - 274