An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning

被引：50

作者：

Ishiwaka, Y

Sato, T

Kakazu, Y

机构：

[1] Hakodate Natl Coll Technol, Hakodate, Hokkaido, Japan

[2] Future Univ Hakodate, Hakodate, Hokkaido, Japan

[3] Hokkaido Univ, Sapporo, Hokkaido, Japan

来源：

ROBOTICS AND AUTONOMOUS SYSTEMS | 2003年 / 43卷 / 04期

关键词：

pursuit problem; prediction; Q-learning; emergence; heterogeneous multiagent system;

D O I：

10.1016/S0921-8890(03)00040-X

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cooperation among agents is important for multiagent systems having a shared goal. In this paper, an example of the pursuit problem is studied, in which four hunters collaborate to catch a target. A reinforcement learning algorithm is employed to model how the hunters acquire this cooperative behavior to achieve the task. In order to apply Q-learning, which is one way of reinforcement learning, two kinds of prediction are needed for each hunter agent. One is the location of the other hunter agents and target agent, and the other is the movement direction of the target agent at next time step t. In our treatment we extend the standard problem to systems with heterogeneous agents. One motivation for this is that the target agent and hunter agents have differing abilities. In addition, even though those hunter agents are homogeneous at the beginning of the problem, their abilities become heterogeneous in the learning process. Simulations of this pursuit problem were performed on a continuous action state space, the results of which are displayed, accompanied by a discussion of their outcomes' dependence upon the initial locations of the hunters and the speeds of the hunters and a target. (C) 2003 Elsevier Science B.V. All rights reserved.

引用

页码：245 / 256

页数：12

共 50 条

[31] Reinforcement learning with phased approach for fast learning
Hodohara, Norifumi
Murakami, Yuichi
Nakamura, Shingo
Hashimoto, Shuji
PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 930 - 933
[32] State Augmentation via Self-Supervision in Offline Multiagent Reinforcement Learning
Wang, Siying
Li, Xiaodie
Qu, Hong
Chen, Wenyu
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (03) : 1051 - 1062
[33] Online control of stencil printing parameters using reinforcement learning approach
Khader, Nourma
Yoon, Sang Won
28TH INTERNATIONAL CONFERENCE ON FLEXIBLE AUTOMATION AND INTELLIGENT MANUFACTURING (FAIM2018): GLOBAL INTEGRATION OF INTELLIGENT MANUFACTURING AND SMART INDUSTRY FOR GOOD OF HUMANITY, 2018, 17 : 94 - 101
[34] Improved decision making in multiagent system for diagnostic application using cooperative learning algorithms
Vidhate D.A.
Kulkarni P.
International Journal of Information Technology, 2018, 10 (2) : 201 - 209
[35] Reinforcement Learning approaches to Economic Dispatch problem
Jasmin, E. A.
Ahamed, T. P. Imthias
Raj, V. P. Jagathy
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2011, 33 (04) : 836 - 845
[36] A Reinforcement Learning Approach for Solving Integrated Mass Customization Process Planning and Job-Shop Scheduling Problem in a Reconfigurable Manufacturing System
Gao, Sini
Daaboul, Joanna
Le Duigou, Julien
12TH INTERNATIONAL WORKSHOP ON SERVICE ORIENTED, HOLONIC AND MULTI-AGENT MANUFACTURING SYSTEMS FOR INDUSTRY OF THE FUTURE, SOHOMA 2022, 2023, 1083 : 395 - 406
[37] Contracts for Difference: A Reinforcement Learning Approach
Zengeler, Nico
Handmann, Uwe
JOURNAL OF RISK AND FINANCIAL MANAGEMENT, 2020, 13 (04)
[38] SQIX: QMIX Algorithm Activated by General Softmax Operator for Cooperative Multiagent Reinforcement Learning
Zhang, Miaomiao
Tong, Wei
Zhu, Guangyu
Xu, Xin
Wu, Edmond Q.
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (11): : 6550 - 6560
[39] Distributed Actor-Critic Algorithms for Multiagent Reinforcement Learning Over Directed Graphs
Dai, Pengcheng
Yu, Wenwu
Wang, He
Baldi, Simone
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7210 - 7221
[40] Global synchromodal shipment matching problem with dynamic and stochastic travel times: a reinforcement learning approach
Guo, W.
Atasoy, B.
Negenborn, R. R.
ANNALS OF OPERATIONS RESEARCH, 2022,

← 1 2 3 4 5 →