Improving Multi-Robot Behavior Using Learning-Based Receding Horizon Task Allocation

被引:0
作者
Schillinger, Philipp [1 ,2 ,3 ]
Buerger, Mathias [1 ]
Dimarogonas, Dimos, V [2 ,3 ]
机构
[1] Bosch Ctr Artificial Intelligence, Renningen, Germany
[2] KTH Royal Inst Technol, KTH Ctr Autonomous Syst, Stockholm, Sweden
[3] KTH Royal Inst Technol, ACCESS Linnaeus Ctr EECS, Stockholm, Sweden
来源
ROBOTICS: SCIENCE AND SYSTEMS XIV | 2018年
基金
欧盟地平线“2020”; 瑞典研究理事会;
关键词
MARKOV DECISION-PROCESSES; DECENTRALIZED CONTROL; MULTIAGENT; COORDINATION;
D O I
暂无
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Planning efficient and coordinated policies for a team of robots is a computationally demanding problem, especially when the system faces uncertainty in the outcome or duration of actions. In practice, approximation methods are usually employed to plan reasonable team policies in an acceptable time. At the same time, many typical robotic tasks include a repetitive pattern. On the one hand, this multiplies the increased cost of inefficient solutions. But on the other hand, it also provides the potential for improving an initial, inefficient solution over time. In this paper, we consider the case that a single mission specification is given to a multi-robot system, describing repetitive tasks which allow the robots to parallelize work. We propose here a decentralized coordination scheme which enables the robots to decompose the full specification, execute distributed tasks, and improve their strategy over time.
引用
收藏
页数:10
相关论文
共 42 条
[1]  
Aksaray D, 2016, IEEE DECIS CONTR P, P6565, DOI 10.1109/CDC.2016.7799279
[2]   Policy search for multi-robot coordination under uncertainty [J].
Amato, Christopher ;
Konidaris, George ;
Anders, Ariel ;
Cruz, Gabriel ;
How, Jonathan P. ;
Kaelbling, Leslie P. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2016, 35 (14) :1760-1778
[3]  
[Anonymous], 2003, Advances in neural information processing systems
[4]  
[Anonymous], 2005, Robotics: Science and Systems
[5]  
[Anonymous], ARXIV170307887
[6]  
[Anonymous], 2017, arXiv
[7]  
[Anonymous], 2015, Reinforcement Learning: An Introduction
[8]  
[Anonymous], 2017, ARXIV170602275
[9]  
Baier C, 2008, PRINCIPLES OF MODEL CHECKING, P1
[10]  
Belta C, 2017, STUD SYST DECIS CONT, V89, P1, DOI 10.1007/978-3-319-50763-7