Dec-MCTS: Decentralized planning for multi-robot active perception

被引：123

作者：

Best, Graeme ^{[1
]}

Cliff, Oliver M. ^{[1
]}

Patten, Timothy ^{[1
,2
]}

Mettu, Ramgopal R. ^{[3
]}

Fitch, Robert ^{[1
,4
]}

机构：

[1] Univ Sydney, ACFR, J04, Sydney, NSW 2006, Australia

[2] Vienna Univ Technol, Automat & Control Inst, Vienna, Austria

[3] Tulane Univ, Dept Comp Sci, New Orleans, LA 70118 USA

[4] Univ Technol Sydney, Ctr Autonomous Syst, Sydney, NSW, Australia

来源：

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH | 2019年 / 38卷 / 2-3期

基金：

澳大利亚研究理事会;

关键词：

Decentralized Monte Carlo tree search; multi-robot systems; decentralized planning; active perception; variational methods; ORIENTEERING PROBLEM; APPROXIMATIONS; COMMUNICATION; EXPLORATION; INFERENCE; OBJECTS;

D O I：

10.1177/0278364918755924

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

We propose a decentralized variant of Monte Carlo tree search (MCTS) that is suitable for a variety of tasks in multi-robot active perception. Our algorithm allows each robot to optimize its own actions by maintaining a probability distribution over plans in the joint-action space. Robots periodically communicate a compressed form of their search trees, which are used to update the joint distribution using a distributed optimization approach inspired by variational methods. Our method admits any objective function defined over robot action sequences, assumes intermittent communication, is anytime, and is suitable for online replanning. Our algorithm features a new MCTS tree expansion policy that is designed for our planning scenario. We extend the theoretical analysis of standard MCTS to provide guarantees for convergence rates to the optimal payoff sequence. We evaluate the performance of our method for generalized team orienteering and online active object recognition using real data, and show that it compares favorably to centralized MCTS even with severely degraded communication. These examples demonstrate the suitability of our algorithm for real-world active perception with multiple robots.

引用

页码：316 / 337

页数：22

共 91 条

[1] Persistent monitoring in discrete environments: Minimizing the maximum weighted latency between observations [J].

Alamdari, Soroush ;

Fata, Elaheh ;

Smith, Stephen L. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2014, 33 (01) :138-154

[2]

[Anonymous], THESIS U SYDNEY

[3]

[Anonymous], 2006, Technical report

[4]

[Anonymous], UNMANNED SYSTEMS

[5]

[Anonymous], 2014, J ARTIF INTELL RES

[6]

[Anonymous], 8936 U MICH DEP IND

[7]

[Anonymous], P INT JOINT C AUT AG

[8]

[Anonymous], P ARAA AUSTR C ROB A

[9]

[Anonymous], P ROB SCI SYST

[10]

[Anonymous], P INT WORKSH ALG FDN

← 1 2 3 4 5 6 7 8 9 10 →