Learning Optimal Navigation Actions for Foresighted Robot Behavior During Assistance Tasks

被引:0
作者
Bayoumi, AbdElMoniem [1 ]
Bennewitz, Maren [1 ]
机构
[1] Univ Bonn, Inst Comp Sci, Bonn, Germany
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2016年
关键词
TRACKING; TARGET;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an approach to learn optimal navigation actions for assistance tasks in which the robot aims at efficiently reaching the final navigation goal of a human where service has to be provided. Always following the human at a close distance might hereby result in inefficient trajectories, since people regularly do not move on the shortest path to their destination (e. g., they move to grab the phone or make a note). Therefore, a service robot should infer the human's intended navigation goal and compute its own motion based on that prediction. We developed an approach that applies reinforcement learning to get a Q-function that determines for each pair of the robot's and human's relative positions the best navigation action for the robot. Our approach applies a prediction of the human's motion based on a softened Markov decision process (MDP). This MDP is independent from the navigation learning framework and is learned beforehand on previously observed trajectories. We thoroughly evaluated our method in simulation and on a real robot. As the experimental results show, our approach leads to foresighted navigation behavior and significantly reduces the path length and completion time compared to naive following strategies.
引用
收藏
页码:207 / 212
页数:6
相关论文
共 15 条
  • [1] [Anonymous], T CONTROL AUTOMATION
  • [2] [Anonymous], 2020, Reinforcement Learning, An Introduction
  • [3] Learning motion patterns of people for compliant robot motion
    Bennewitz, M
    Burgard, W
    Cielniak, G
    Thrun, S
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2005, 24 (01) : 31 - 48
  • [4] Goldhoorn A., 2014, P IEEE RAS INT C HUM
  • [5] Robot team coordination for target tracking using fuzzy logic controller in game theoretic framework
    Harmati, Istvan
    Skrzypczyk, Krzysztof
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2009, 57 (01) : 75 - 86
  • [6] Control approach for tracking a moving target by a wheeled mobile robot with limited velocities
    Huang, L.
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2009, 3 (12) : 1565 - 1577
  • [7] Kuderer M., 2014, LECT NOTES COMPUTER, V8755
  • [8] Laugier C., 2007, STAR, P45
  • [9] Morales Y., 2012, ACM IEEE INT C HUM R
  • [10] Multi-robot nonlinear model predictive formation control: Moving target and target absence
    Nascimento, Tiago P.
    Moreira, Antonio Paulo
    Scolari Conceicao, Andre G.
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (12) : 1502 - 1515