Increasing Self-Adaptation in a Hybrid Decision-Making and Planning System with Reinforcement Learning

被引:5
作者
Hrabia, Christopher-Eyk [1 ]
Lehmann, Patrick Marvin [1 ]
Albayrak, Sahin [1 ]
机构
[1] Tech Univ Berlin, DAI Lab, Ernst Reuter Pl 7, D-10587 Berlin, Germany
来源
2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1 | 2019年
关键词
decision-making; planning; reinforcement learning; self-adaptation; autonomous robots; GAME; GO;
D O I
10.1109/COMPSAC.2019.00073
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Task-level decision-making and AT planning are used to control autonomous robots from a high-level, mission-oriented perspective. The dynamic selection of most suitable actions allows the system to adapt to changes in the environment as well as its own state. Nevertheless, decision-making and AT planning often require a priori definitions of capabilities, rules, decision models, or world knowledge. Due to the challenge of handling the uncertainty of robot applications in dynamic and uncontrolled environments such definitions or descriptions are always incomplete, hence the possible adaptation capabilities are limited. In this paper, we present how the self-adaptation of a robot planning and decision-making system can be improved by incorporating reinforcement learning. Particularly, we show our approach of integrating deep reinforcement learning into the ROS Hybrid Behaviour Planner (RHBP).
引用
收藏
页码:469 / 478
页数:10
相关论文
共 28 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], 2003, P 3 IEEE RAS INT C H
  • [3] [Anonymous], 1998, REINFORCEMENT LEARNI
  • [4] The Arcade Learning Environment: An Evaluation Platform for General Agents
    Bellemare, Marc G.
    Naddaf, Yavar
    Veness, Joel
    Bowling, Michael
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 : 253 - 279
  • [5] The SMACH High-Level Executive
    Boren, Jonathan
    Cousins, Steve
    [J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 2010, 17 (04) : 18 - 20
  • [6] Cashmore M, 2015, P I C AUTOMAT PLAN S, P333
  • [7] Decugis V, 1998, FROM ANIM ANIMAT, P153
  • [8] Foukarakis Michalis, 2014, Universal Access in Human-Computer Interaction. Aging and Assistive Environments. 8th International Conference, UAHCI 2014, Held as Part of HCI International 2014. Proceedings: LNCS 8515, P625, DOI 10.1007/978-3-319-07446-7_60
  • [9] The Metric-FF planning system: Translating "ignoring delete lists" to numeric state variables
    Hoffmann, J
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2003, 20 : 291 - 341
  • [10] Hrabia C.-E., 2017, COMBINING SELF ORG D, P385