Multi-robot systems with agent-based reinforcement learning: evolution, opportunities and challenges

被引：7

作者：

Yang, Erfu ^{[1
]}

Gu, Dongbing ^{[1
]}

机构：

[1] Univ Essex, Sch Comp Sci & Elect Engn, Wivenhoe Pk, Colchester CO4 3SQ, Essex, England

来源：

INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL | 2009年 / 6卷 / 04期

基金：

英国工程与自然科学研究理事会;

关键词：

multi-robot systems; MRSs; reinforcement learning; multi-agent systems; stochastic games; approximation and generalisation; fuzzy logic; survey;

D O I：

10.1504/IJMIC.2009.024735

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-agent reinforcement learning for multi-robot systems is a challenging issue in both robotics and artificial intelligence. With the ever increasing interests in theoretical researches and practical applications, currently there have been a lot of efforts towards providing good solutions to this challenge. However, there are still many difficulties in scaling up multi- agent reinforcement learning to multi-robot systems. This paper presents a survey on the evolution, opportunities and challenges of applying agent-based reinforcement learning to multi- robot systems. After reviewing some important advances in this field, some challenging problems and promising research directions are focused on. A concluding remark is made from the perspectives of the authors.

引用

页码：271 / 286

页数：16

共 80 条

[1] Multiagent reinforcement learning using function approximation [J].

Abul, O ;

Polat, F ;

Alhajj, R .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2000, 30 (04) :485-497

[2] Expertness based cooperative Q-learning [J].

Ahmadabadi, MN ;

Asadpour, M .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01) :66-76

[3]

[Anonymous], 1999, THEORY LEARNING GAME

[4]

[Anonymous], P IEEE 2005 S COMP I

[5] Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development [J].

Asada, M ;

Uchibe, E ;

Hosoda, K .

ARTIFICIAL INTELLIGENCE, 1999, 110 (02) :275-292

[6] Behavior-based formation control for multirobot teams [J].

Balch, T ;

Arkin, RC .

IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1998, 14 (06) :926-939

[7]

Banerjee B., 2003, P 2 INT JOINT C AUT, P686, DOI [10.1145/860575.860686, DOI 10.1145/860575.860686]

[8]

Banerjee B., 2002, 13 EUR C MACH LEARN, P686

[9]

Basar T, 1982, DYNAMIC NONCOOPERATI

[10]

Berenji H. R., 2000, IIS0010

← 1 2 3 4 5 6 7 8 →