An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning

被引:50
|
作者
Ishiwaka, Y
Sato, T
Kakazu, Y
机构
[1] Hakodate Natl Coll Technol, Hakodate, Hokkaido, Japan
[2] Future Univ Hakodate, Hakodate, Hokkaido, Japan
[3] Hokkaido Univ, Sapporo, Hokkaido, Japan
关键词
pursuit problem; prediction; Q-learning; emergence; heterogeneous multiagent system;
D O I
10.1016/S0921-8890(03)00040-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperation among agents is important for multiagent systems having a shared goal. In this paper, an example of the pursuit problem is studied, in which four hunters collaborate to catch a target. A reinforcement learning algorithm is employed to model how the hunters acquire this cooperative behavior to achieve the task. In order to apply Q-learning, which is one way of reinforcement learning, two kinds of prediction are needed for each hunter agent. One is the location of the other hunter agents and target agent, and the other is the movement direction of the target agent at next time step t. In our treatment we extend the standard problem to systems with heterogeneous agents. One motivation for this is that the target agent and hunter agents have differing abilities. In addition, even though those hunter agents are homogeneous at the beginning of the problem, their abilities become heterogeneous in the learning process. Simulations of this pursuit problem were performed on a continuous action state space, the results of which are displayed, accompanied by a discussion of their outcomes' dependence upon the initial locations of the hunters and the speeds of the hunters and a target. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:245 / 256
页数:12
相关论文
共 50 条
  • [31] Reinforcement learning with phased approach for fast learning
    Hodohara, Norifumi
    Murakami, Yuichi
    Nakamura, Shingo
    Hashimoto, Shuji
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 930 - 933
  • [32] State Augmentation via Self-Supervision in Offline Multiagent Reinforcement Learning
    Wang, Siying
    Li, Xiaodie
    Qu, Hong
    Chen, Wenyu
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (03) : 1051 - 1062
  • [33] Online control of stencil printing parameters using reinforcement learning approach
    Khader, Nourma
    Yoon, Sang Won
    28TH INTERNATIONAL CONFERENCE ON FLEXIBLE AUTOMATION AND INTELLIGENT MANUFACTURING (FAIM2018): GLOBAL INTEGRATION OF INTELLIGENT MANUFACTURING AND SMART INDUSTRY FOR GOOD OF HUMANITY, 2018, 17 : 94 - 101
  • [34] Improved decision making in multiagent system for diagnostic application using cooperative learning algorithms
    Vidhate D.A.
    Kulkarni P.
    International Journal of Information Technology, 2018, 10 (2) : 201 - 209
  • [35] Reinforcement Learning approaches to Economic Dispatch problem
    Jasmin, E. A.
    Ahamed, T. P. Imthias
    Raj, V. P. Jagathy
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2011, 33 (04) : 836 - 845
  • [36] A Reinforcement Learning Approach for Solving Integrated Mass Customization Process Planning and Job-Shop Scheduling Problem in a Reconfigurable Manufacturing System
    Gao, Sini
    Daaboul, Joanna
    Le Duigou, Julien
    12TH INTERNATIONAL WORKSHOP ON SERVICE ORIENTED, HOLONIC AND MULTI-AGENT MANUFACTURING SYSTEMS FOR INDUSTRY OF THE FUTURE, SOHOMA 2022, 2023, 1083 : 395 - 406
  • [37] Contracts for Difference: A Reinforcement Learning Approach
    Zengeler, Nico
    Handmann, Uwe
    JOURNAL OF RISK AND FINANCIAL MANAGEMENT, 2020, 13 (04)
  • [38] SQIX: QMIX Algorithm Activated by General Softmax Operator for Cooperative Multiagent Reinforcement Learning
    Zhang, Miaomiao
    Tong, Wei
    Zhu, Guangyu
    Xu, Xin
    Wu, Edmond Q.
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (11): : 6550 - 6560
  • [39] Distributed Actor-Critic Algorithms for Multiagent Reinforcement Learning Over Directed Graphs
    Dai, Pengcheng
    Yu, Wenwu
    Wang, He
    Baldi, Simone
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7210 - 7221
  • [40] Global synchromodal shipment matching problem with dynamic and stochastic travel times: a reinforcement learning approach
    Guo, W.
    Atasoy, B.
    Negenborn, R. R.
    ANNALS OF OPERATIONS RESEARCH, 2022,