An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning

被引：50

作者：

Ishiwaka, Y

Sato, T

Kakazu, Y

机构：

[1] Hakodate Natl Coll Technol, Hakodate, Hokkaido, Japan

[2] Future Univ Hakodate, Hakodate, Hokkaido, Japan

[3] Hokkaido Univ, Sapporo, Hokkaido, Japan

来源：

ROBOTICS AND AUTONOMOUS SYSTEMS | 2003年 / 43卷 / 04期

关键词：

pursuit problem; prediction; Q-learning; emergence; heterogeneous multiagent system;

D O I：

10.1016/S0921-8890(03)00040-X

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cooperation among agents is important for multiagent systems having a shared goal. In this paper, an example of the pursuit problem is studied, in which four hunters collaborate to catch a target. A reinforcement learning algorithm is employed to model how the hunters acquire this cooperative behavior to achieve the task. In order to apply Q-learning, which is one way of reinforcement learning, two kinds of prediction are needed for each hunter agent. One is the location of the other hunter agents and target agent, and the other is the movement direction of the target agent at next time step t. In our treatment we extend the standard problem to systems with heterogeneous agents. One motivation for this is that the target agent and hunter agents have differing abilities. In addition, even though those hunter agents are homogeneous at the beginning of the problem, their abilities become heterogeneous in the learning process. Simulations of this pursuit problem were performed on a continuous action state space, the results of which are displayed, accompanied by a discussion of their outcomes' dependence upon the initial locations of the hunters and the speeds of the hunters and a target. (C) 2003 Elsevier Science B.V. All rights reserved.

引用

页码：245 / 256

页数：12

共 50 条

[21] An Improved Reinforcement Learning System Using Affective Factors
Kuremoto, Takashi
Tsurusaki, Tetsuya
Kobayashi, Kunikazu
Mabu, Shingo
Obayashi, Masanao
ROBOTICS, 2013, 2 (03): : 149 - 164
[22] Adaptive Individual Q-Learning-A Multiagent Reinforcement Learning Method for Coordination Optimization
Zhang, Zhen
Wang, Dongqing
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12
[23] Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch
Xu, Yinliang
Zhang, Wei
Liu, Wenxin
Ferrese, Frank
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1742 - 1751
[24] Reinforcement Learning Approach to Feedback Stabilization Problem of Probabilitic Boolcan Control Networks
Acernese, Antonio
Yerudkar, Amol
Glielmo, Luigi
Del Vecchio, Carmen Del
IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (01): : 337 - 342
[25] FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks
Zhang, Zhen
Zhao, Dongbin
Gao, Junwei
Wang, Dongqing
Dai, Yujie
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) : 1367 - 1379
[26] Fuzzy Graph and Collective Multiagent Reinforcement Learning for Traffic Signals Control
Abdoos, Monireh
IEEE INTELLIGENT SYSTEMS, 2021, 36 (04) : 48 - 55
[27] VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning
Wei, Qinglai
Li, Yugu
Zhang, Jie
Wang, Fei-Yue
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 182 - 195
[28] A reinforcement learning approach to dairy farm battery management using Q learning
Ali, Nawazish
Wahid, Abdul
Shaw, Rachael
Mason, Karl
JOURNAL OF ENERGY STORAGE, 2024, 93
[29] A Forecasting Approach to Cryptocurrency Price Index Using Reinforcement Learning
Mariappan, L. Thanga
Pandian, J. Arun
Kumar, V. Dhilip
Geman, Oana
Chiuchisan, Iuliana
Nastase, Carmen
APPLIED SCIENCES-BASEL, 2023, 13 (04):
[30] A reinforcement learning based computational intelligence approach for binary optimization problems: The case of the set-union knapsack problem
Ozsoydan, Fehmi Burcin
Golcuk, Ilker
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 118

← 1 2 3 4 5 →