Policy search for multi-robot coordination under uncertainty

被引：32

作者：

Amato, Christopher ^{[1
]}

Konidaris, George ^{[2
]}

Anders, Ariel ^{[3
]}

Cruz, Gabriel ^{[3
]}

How, Jonathan P. ^{[4
]}

Kaelbling, Leslie P. ^{[3
]}

机构：

[1] Northeastern Univ, Coll Comp & Informat Sci, 360 Huntington Ave, Boston, MA 02115 USA

[2] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA

[3] MIT, CSAIL, Cambridge, MA 02139 USA

[4] MIT, LIDS, Cambridge, MA 02139 USA

来源：

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH | 2016年 / 35卷 / 14期

基金：

美国国家科学基金会;

关键词：

AI reasoning methods; autonomous agents; distributed robot systems; DECENTRALIZED CONTROL; FRAMEWORK; MOTION;

D O I：

10.1177/0278364916679611

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

We introduce a principled method for multi-robot coordination based on a general model (termed a MacDec-POMDP) of multi-robot cooperative planning in the presence of stochasticity, uncertain sensing, and communication limitations. A new MacDec-POMDP planning algorithm is presented that searches over policies represented as finite-state controllers, rather than the previous policy tree representation. Finite-state controllers can be much more concise than trees, are much easier to interpret, and can operate over an infinite horizon. The resulting policy search algorithm requires a substantially simpler simulator that models only the outcomes of executing a given set of motor controllers, not the details of the executions themselves and can solve significantly larger problems than existing MacDec-POMDP planners. We demonstrate significant performance improvements over previous methods and show that our method can be used for actual multi-robot systems through experiments on a cooperative multi-robot bartending domain.

引用

页码：1760 / 1778

页数：19

共 39 条

[1]

Amato C, 2014, AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, P1273

[2]

Amato C, 2015, IEEE INT CONF ROBOT, P1241, DOI 10.1109/ICRA.2015.7139350

[3]

Amato C, 2013, IEEE DECIS CONTR P, P2398, DOI 10.1109/CDC.2013.6760239

[4] Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs [J].

Amato, Christopher ;

Bernstein, Daniel S. ;

Zilberstein, Shlomo .

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2010, 21 (03) :293-320

[5]

[Anonymous], 2010, Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence

[6]

[Anonymous], 2013, P 27 AAAI C ARTIFICI

[7]

[Anonymous], NEURAL INFORM PROCES

[8]

[Anonymous], 2009, INT C AUT AG MULT SY

[9]

[Anonymous], 2005, INT J HUMANOID ROB, DOI DOI 10.1142/S0219843605000545

[10] Integrated perception and planning in the continuous space: A POMDP approach [J].

Bai, Haoyu ;

Hsu, David ;

Lee, Wee Sun .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2014, 33 (09) :1288-1302

← 1 2 3 4 →