Distributed learning and cooperative control for multi-agent systems

被引:185
作者
Choi, Jongeun [1 ,2 ]
Oh, Songhwai [3 ]
Horowitz, Roberto [4 ]
机构
[1] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
[3] Seoul Natl Univ, Sch Elect Engn & Comp Sci, Seoul, South Korea
[4] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Cooperative control; Multi-agent systems; Recursive parameter estimation; SENSOR NETWORKS; COORDINATION; ALGORITHMS;
D O I
10.1016/j.automatica.2009.09.025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an algorithm and analysis of distributed learning and cooperative control for a multi-agent system so that a global goal of the overall system can be achieved by locally acting agents. We consider a resource-constrained multi-agent system, in which each agent has limited capabilities in terms of sensing, computation, and communication. The proposed algorithm is executed by each agent independently to estimate an unknown field of interest from noisy measurements and to coordinate multiple agents in a distributed manner to discover peaks of the unknown field. Each mobile agent maintains its own local estimate of the field and updates the estimate using collective measurements from itself and nearby agents. Each agent then moves towards peaks of the field using the gradient of its estimated field while avoiding collision and maintaining communication connectivity. The proposed algorithm is based on a recursive spatial estimation of an unknown field. We show that the closed-loop dynamics of the proposed multi-agent system can be transformed into a form of a stochastic approximation algorithm and prove its convergence using Ljung's ordinary differential equation (ODE) approach. We also present extensive simulation results supporting our theoretical results. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2802 / 2814
页数:13
相关论文
共 37 条
[1]  
ADLER J, 1966, J SUPRAMOLECULAR STR, V4, P305
[2]  
[Anonymous], 1991, STAT SPATIAL DATA
[3]  
[Anonymous], 2001, SWARM INTELLIGENCE J, DOI DOI 10.1007/S00897020553A
[4]  
[Anonymous], IEEE T CONT IN PRESS
[5]  
[Anonymous], 1999, Swarm Intelligence
[6]  
Astrom K.J., 1995, ADAPTIVE CONTROL
[7]  
BRUS L, 2006, RECURSIVE BLACK BOX
[8]  
CHOI J, 2008, P INT FED AUT CONTR
[9]  
CHOI J, 2007, P 46 IEEE C DEC CONT
[10]  
CHOI J, 2008, P 2008 AM CONTR C AC