Long-Run Multi-Robot Planning under Uncertain Action Durations for Persistent Tasks

被引：6

作者：

Azevedo, Carlos ^{[1
]}

Lacerda, Bruno ^{[2
]}

Hawes, Nick ^{[2
]}

Lima, Pedro ^{[1
]}

机构：

[1] Univ Lisbon, Inst Syst & Robot, Inst Super Tecn, Lisbon, Portugal

[2] Univ Oxford, Oxford Robot Inst, Oxford, England

来源：

2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2020年

基金：

英国科研创新办公室; 英国工程与自然科学研究理事会;

关键词：

D O I：

10.1109/IROS45743.2020.9340901

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an approach for multi-robot long-term planning under uncertainty over the duration of actions. The proposed methodology takes advantage of generalized stochastic Petri nets with rewards (GSPNR) to model multi-robot problems. A GSPNR allows for unified modeling of action selection, uncertainty on the duration of action execution, and for goal specification through the use of transition rewards and rewards per time unit. Our approach relies on the interpretation of the GSPNR model as an equivalent embedded Markov reward automaton (MRA). We then build on a state-of-the-art method to compute the long-run average reward over MRAs, extending it to enable the extraction of the optimal policy. We provide an empirical evaluation of the proposed approach on a simulated multi-robot monitoring problem, evaluating its performance and scalability. The results show that the synthesized policy outperforms a policy obtained from an infinite horizon discounted reward formulation as well as a carefully hand-crafted policy.

引用

页码：4323 / 4328

页数：6

共 18 条

[1] Modeling and Planning with Macro-Actions in Decentralized POMDPs [J].

Amato, Christopher ;

Konidaris, George ;

Kaelbling, Leslie P. ;

How, Jonathan P. .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 64 :817-859

[2]

Azevedo C., 2020, AAMAS

[3]

Butkova Y., 2017, TACAS

[4]

Chatterjee K, 2011, PROCEEDINGS OF THE TWENTY-SECOND ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1318

[5] Robot task plan representation by Petri nets: modelling, identification, analysis and execution [J].

Costelha, Hugo ;

Lima, Pedro .

AUTONOMOUS ROBOTS, 2012, 33 (04) :337-360

[6]

Eisentraut Christian, 2013, Application and Theory of Petri Nets and Concurrency. 34th International Conference, PETRI NETS 2013. Proceedings: LNCS 7927, P90, DOI 10.1007/978-3-642-38697-8_6

[7] ANALYSIS OF TIMED AND LONG-RUN OBJECTIVES FOR MARKOV AUTOMATA [J].

Guck, Dennis ;

Hatefi, Hassan ;

Hermanns, Holger ;

Katoen, Joost-Pieter ;

Timmer, Mark .

LOGICAL METHODS IN COMPUTER SCIENCE, 2014, 10 (03)

[8] Petri net based multi-robot task coordination from temporal logic specifications [J].

Lacerda, Bruno ;

Lima, Pedro U. .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 122

[9] Robot Planning Based on Boolean Specifications Using Petri Net Models [J].

Mahulea, Cristian ;

Kloetzer, Marius .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (07) :2218-2225

[10]

Mansouri M., 2019, IJCAI

← 1 2 →