Robot awareness in cooperative mobile robot learning

被引:10
作者
Touzet, CF [1 ]
机构
[1] Oak Ridge Natl Lab, Ctr Engn Sci Adv Res, Div Math & Comp Sci, Oak Ridge, TN 37831 USA
关键词
cooperative robotics; cooperative learning; robot awareness; CMOMMT; lazy reinforcement learning;
D O I
10.1023/A:1008945119734
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the straight-forward learning approaches in cooperative robotics imply for each learning robot a state space growth exponential in the number of team members. To remedy the exponentially large state space, we propose to investigate a less demanding cooperation mechanism-i.e., various levels of awareness-instead of communication. We define awareness as the perception of other robots locations and actions. We recognize four different levels (or degrees) of awareness which imply different amounts of additional information and therefore have different impacts on the search space size (Theta(0), Theta(1), Theta(N), o(N),(1) where N is the number of robots in the team). There are trivial arguments in favor of avoiding binding the increase of the search space size to the number of team members. We advocate that, by studying the maximum number of neighbor robots in the application context, it is possible to tune the parameters associated with a Theta(1) increase of the search space size and allow good learning performance. We use the cooperative multi-robot observation of multiple moving targets (CMOMMT) application to illustrate our method. We verify that awareness allows cooperation, that cooperation shows better performance than a purely collective behavior and that learned cooperation shows better results than learned collective behavior.
引用
收藏
页码:87 / 97
页数:11
相关论文
共 25 条
[1]  
Aha D., 1997, LAZY LEARNING
[2]  
Balch T., 1994, Autonomous Robots, V1, P27, DOI 10.1007/BF00735341
[3]  
Braitenberg V., 1984, VEHICLES EXPT SYNTHE
[4]  
BROOKS RA, 1991, IJCAI 91
[5]   Cooperative mobile robotics: Antecedents and directions [J].
Cao, YU ;
Fukunaga, AS ;
Kahng, AB .
AUTONOMOUS ROBOTS, 1997, 4 (01) :7-27
[6]  
Dorigo M, 1996, IEEE T SYST MAN CY B, V26, P361
[7]  
HEEMSKERK J, 1996, IEEE SEM SELF LEARN, P12
[8]   Reinforcement learning: A survey [J].
Kaelbling, LP ;
Littman, ML ;
Moore, AW .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285
[9]  
KRETCHMAR RM, 1997, P ICNN 97
[10]  
Kube C. Ronald, 1993, Adaptive Behavior, V2, P189, DOI 10.1177/105971239300200204