Robot Awareness in Cooperative Mobile Robot Learning

被引：0

作者：

Claude F. Touzet

机构：

[1] Oak Ridge National Laboratory,Center for Engineering Science Advanced Research, Computer Science and Mathematics Division

来源：

Autonomous Robots | 2000年 / 8卷

关键词：

cooperative robotics; cooperative learning; robot awareness; CMOMMT; lazy reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Most of the straight-forward learning approaches in cooperative robotics imply for each learning robot a state space growth exponential in the number of team members. To remedy the exponentially large state space, we propose to investigate a less demanding cooperation mechanism—i.e., various levels of awareness—instead of communication. We define awareness as the perception of other robots locations and actions. We recognize four different levels (or degrees) of awareness which imply different amounts of additional information and therefore have different impacts on the search space size (Θ(0), Θ(1), Θ(N), o(N),1 where N is the number of robots in the team). There are trivial arguments in favor of avoiding binding the increase of the search space size to the number of team members. We advocate that, by studying the maximum number of neighbor robots in the application context, it is possible to tune the parameters associated with a Θ(1) increase of the search space size and allow good learning performance. We use the cooperative multi-robot observation of multiple moving targets (CMOMMT) application to illustrate our method. We verify that awareness allows cooperation, that cooperation shows better performance than a purely collective behavior and that learned cooperation shows better results than learned collective behavior.

引用

页码：87 / 97

页数：10

共 15 条

[1]

Balch T.(1994)Communication in reactive multiagent robotic systems Autonomous Robots 1 27-52

[2]

Arkin R.(1997)Cooperative mobile robotics: Antecedent and directions Autonomous Robots 4 7-27

[3]

Cao Y.U.(1996)Reinforcement learning: A survey Journal of Artificial Intelligence Research 4 237-285

[4]

Fukuaga A.(1994)Collective robotics: From social insects to robots Adaptive Behavior 2 189-218

[5]

Kahng A.(1992)Self-improving reactive agents based on reinforcement learning, planning and teaching Machine Learning 8 293-321

[6]

Kaelbling L.(1997)Reinforcement learning in multi-robot domain Autonomous Robots 4 73-83

[7]

Littman M.(1997)Learning social behavior Robotics and Autonomous Systems 20 191-204

[8]

Moore A.(1999)Exploration tuned reinforcement function Neurocomputing 28 93-105

[9]

Kube R.(undefined)undefined undefined undefined undefined-undefined

[10]

Zhang H.(undefined)undefined undefined undefined undefined-undefined

← 1 2 →