Multi-robot systems with agent-based reinforcement learning: evolution, opportunities and challenges

被引:7
作者
Yang, Erfu [1 ]
Gu, Dongbing [1 ]
机构
[1] Univ Essex, Sch Comp Sci & Elect Engn, Wivenhoe Pk, Colchester CO4 3SQ, Essex, England
基金
英国工程与自然科学研究理事会;
关键词
multi-robot systems; MRSs; reinforcement learning; multi-agent systems; stochastic games; approximation and generalisation; fuzzy logic; survey;
D O I
10.1504/IJMIC.2009.024735
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-agent reinforcement learning for multi-robot systems is a challenging issue in both robotics and artificial intelligence. With the ever increasing interests in theoretical researches and practical applications, currently there have been a lot of efforts towards providing good solutions to this challenge. However, there are still many difficulties in scaling up multi- agent reinforcement learning to multi-robot systems. This paper presents a survey on the evolution, opportunities and challenges of applying agent-based reinforcement learning to multi- robot systems. After reviewing some important advances in this field, some challenging problems and promising research directions are focused on. A concluding remark is made from the perspectives of the authors.
引用
收藏
页码:271 / 286
页数:16
相关论文
共 80 条
[1]   Multiagent reinforcement learning using function approximation [J].
Abul, O ;
Polat, F ;
Alhajj, R .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2000, 30 (04) :485-497
[2]   Expertness based cooperative Q-learning [J].
Ahmadabadi, MN ;
Asadpour, M .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01) :66-76
[3]  
[Anonymous], 1999, THEORY LEARNING GAME
[4]  
[Anonymous], P IEEE 2005 S COMP I
[5]   Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development [J].
Asada, M ;
Uchibe, E ;
Hosoda, K .
ARTIFICIAL INTELLIGENCE, 1999, 110 (02) :275-292
[6]   Behavior-based formation control for multirobot teams [J].
Balch, T ;
Arkin, RC .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1998, 14 (06) :926-939
[7]  
Banerjee B., 2003, P 2 INT JOINT C AUT, P686, DOI [10.1145/860575.860686, DOI 10.1145/860575.860686]
[8]  
Banerjee B., 2002, 13 EUR C MACH LEARN, P686
[9]  
Basar T, 1982, DYNAMIC NONCOOPERATI
[10]  
Berenji H. R., 2000, IIS0010