Improving Multi-Robot Behavior Using Learning-Based Receding Horizon Task Allocation

被引：0

作者：

Schillinger, Philipp ^{[1
,2
,3
]}

Buerger, Mathias ^{[1
]}

Dimarogonas, Dimos, V ^{[2
,3
]}

机构：

[1] Bosch Ctr Artificial Intelligence, Renningen, Germany

[2] KTH Royal Inst Technol, KTH Ctr Autonomous Syst, Stockholm, Sweden

[3] KTH Royal Inst Technol, ACCESS Linnaeus Ctr EECS, Stockholm, Sweden

来源：

ROBOTICS: SCIENCE AND SYSTEMS XIV | 2018年

基金：

欧盟地平线“2020”; 瑞典研究理事会;

关键词：

MARKOV DECISION-PROCESSES; DECENTRALIZED CONTROL; MULTIAGENT; COORDINATION;

D O I：

暂无

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Planning efficient and coordinated policies for a team of robots is a computationally demanding problem, especially when the system faces uncertainty in the outcome or duration of actions. In practice, approximation methods are usually employed to plan reasonable team policies in an acceptable time. At the same time, many typical robotic tasks include a repetitive pattern. On the one hand, this multiplies the increased cost of inefficient solutions. But on the other hand, it also provides the potential for improving an initial, inefficient solution over time. In this paper, we consider the case that a single mission specification is given to a multi-robot system, describing repetitive tasks which allow the robots to parallelize work. We propose here a decentralized coordination scheme which enables the robots to decompose the full specification, execute distributed tasks, and improve their strategy over time.

引用

页数：10

共 42 条

[1]

Aksaray D, 2016, IEEE DECIS CONTR P, P6565, DOI 10.1109/CDC.2016.7799279

[2] Policy search for multi-robot coordination under uncertainty [J].

Amato, Christopher ;

Konidaris, George ;

Anders, Ariel ;

Cruz, Gabriel ;

How, Jonathan P. ;

Kaelbling, Leslie P. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2016, 35 (14) :1760-1778

[3]

[Anonymous], 2003, Advances in neural information processing systems

[4]

[Anonymous], 2005, Robotics: Science and Systems

[5]

[Anonymous], ARXIV170307887

[6]

[Anonymous], 2017, arXiv

[7]

[Anonymous], 2015, Reinforcement Learning: An Introduction

[8]

[Anonymous], 2017, ARXIV170602275

[9]

Baier C, 2008, PRINCIPLES OF MODEL CHECKING, P1

[10]

Belta C, 2017, STUD SYST DECIS CONT, V89, P1, DOI 10.1007/978-3-319-50763-7

← 1 2 3 4 5 →