Using communication to reduce locality in distributed multiagent learning

被引：52

作者：

Mataric, MJ ^{[1
]}

机构：

[1] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA

来源：

JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE | 1998年 / 10卷 / 03期

基金：

美国国家科学基金会;

关键词：

robotics; machine learning; distributed AI; multi-agent systems; communication between agents;

D O I：

10.1080/095281398146806

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper attempts to bridge the fields of machine learning, robotics, and distributed AI. It discusses the use of communication in reducing the undesirable effects of locality in fully distributed multi-agent systems with multiple agents/robots learning in parallel while interacting with each other. Two key problems, hidden state and credit assignment, are addressed by applying local undirected broadcast communication in a dual role: as sensing and as reinforcement. The methodology is demonstrated on two multi-robot learning experiments. The first describes learning a tightly-coupled coordination task with two robots, the second a loosely-coupled task with four robots learning social rules. Communication is used to (1)share sensory data to overcome hidden state and (2) share reinforcement to overcome the credit assignment problem between the agents and bridge the gap between local/individual and global/group pay-off.

引用

页码：357 / 369

页数：13

共 26 条

[1]

ALTENBURG K, 1993, P IJCAI 93 WORKSH DY, P96

[2]

ASADA M, 1995, P MACH LEARN C WORKS, P1

[3]

Axelrod R, 2006, EVOLUTION COOPERATIO

[4]

DONALD BR, 1993, P INT S ROB RES HIDD

[5]

DUDEK G, 1993, P IJCAI 93 WORKSH DY, P101

[6]

HORSWILL ID, 1993, THESIS MIT

[7]

KAELBLING LP, 1990, THESIS STANFORD U

[8]

LITTMAN ML, 1994, P 11 INT C MACH LEAR, P157

[9]

Mahadevan S, 1996, AI MAG, V17, P89

[10]

MAHADEVAN S, 1991, P AAAI 91, P8

← 1 2 3 →