Multiagent Learning of Coordination in Loosely Coupled Multiagent Systems

被引：40

作者：

Yu, Chao ^{[1
]}

Zhang, Minjie ^{[2
]}

Ren, Fenghui ^{[2
]}

Tan, Guozhen ^{[1
]}

机构：

[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China

[2] Univ Wollongong, Sch Comp Sci & Software Engn, Wollongong, NSW 2522, Australia

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2015年 / 45卷 / 12期

关键词：

Agent independence; coordination; multiagent learning (MAL); reinforcement learning (RL); sparse interactions; DECENTRALIZED CONTROL; COMPLEXITY;

D O I：

10.1109/TCYB.2014.2387277

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multiagent learning (MAL) is a promising technique for agents to learn efficient coordinated behaviors in multiagent systems (MASs). In MAL, concurrent multiple distributed learning processes can make the learning environment nonstationary for each individual learner. Developing an efficient learning approach to coordinate agents' behaviors in this dynamic environment is a difficult problem, especially when agents do not know the domain structure and have only local observability of the environment. In this paper, a coordinated MAL approach is proposed to enable agents to learn efficient coordinated behaviors by exploiting agent independence in loosely coupled MASs. The main feature of the proposed approach is to explicitly quantify and dynamically adapt agent independence during learning so that agents can make a trade-off between a single-agent learning process and a coordinated learning process for an efficient decision making. The proposed approach is employed to solve two-robot navigation problems in different scales of domains. Experimental results show that agents using the proposed approach can learn to act in concert or independently in different areas of the environment, which results in great computational savings and near optimal performance.

引用

页码：2853 / 2867

页数：15

共 53 条

[21]

Guestrin C. E., 2003, THESIS

[22] An overview of cooperative and competitive multiagent learning [J].

Hoen, Pieter Jan 't ;

Tuyls, Karl ;

Panait, Liviu ;

Luke, Sean ;

La Poutre, J. A. .

LEARNING AND ADAPTION IN MULTI-AGENT SYSTEMS, 2006, 3898 :1-46

[23] A fully adaptive decentralized control of robot manipulators [J].

Hsu, Su-Hau ;

Fu, Li-Chen .

AUTOMATICA, 2006, 42 (10) :1761-1767

[24] Multiagent Reinforcement Learning With Unshared Value Functions [J].

Hu, Yujing ;

Gao, Yang ;

An, Bo .

IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (04) :647-662

[25]

Kalyanakrishnan Shivaram, 2009, ROBOT SOCCER WORLD C, P153

[26]

Kok J., 2004, Annual Machine Learning Conference of Belgium and the Netherlands, P65

[27]

Kok J. R., 2005, P IEEE S COMPUTATION, P29

[28]

Kok JR, 2004, PROCINT C MACH LEARN, P61

[29] DECENTRALIZED ADAPTIVE SYNCHRONIZATION OF A STOCHASTIC DISCRETE-TIME MULTIAGENT DYNAMIC MODEL [J].

Ma, Hong-Bin .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2009, 48 (02) :859-880

[30]

Makar F., 2001, Proceedings of the Fifth International Conference on Autonomous Agents, P246, DOI 10.1145/375735.376302

← 1 2 3 4 5 6 →