Increasing Self-Adaptation in a Hybrid Decision-Making and Planning System with Reinforcement Learning

被引：5

作者：

Hrabia, Christopher-Eyk ^{[1
]}

Lehmann, Patrick Marvin ^{[1
]}

Albayrak, Sahin ^{[1
]}

机构：

[1] Tech Univ Berlin, DAI Lab, Ernst Reuter Pl 7, D-10587 Berlin, Germany

来源：

2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1 | 2019年

关键词：

decision-making; planning; reinforcement learning; self-adaptation; autonomous robots; GAME; GO;

D O I：

10.1109/COMPSAC.2019.00073

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Task-level decision-making and AT planning are used to control autonomous robots from a high-level, mission-oriented perspective. The dynamic selection of most suitable actions allows the system to adapt to changes in the environment as well as its own state. Nevertheless, decision-making and AT planning often require a priori definitions of capabilities, rules, decision models, or world knowledge. Due to the challenge of handling the uncertainty of robot applications in dynamic and uncontrolled environments such definitions or descriptions are always incomplete, hence the possible adaptation capabilities are limited. In this paper, we present how the self-adaptation of a robot planning and decision-making system can be improved by incorporating reinforcement learning. Particularly, we show our approach of integrating deep reinforcement learning into the ROS Hybrid Behaviour Planner (RHBP).

引用

页码：469 / 478

页数：10

共 28 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

[Anonymous], 2003, P 3 IEEE RAS INT C H

[3]

[Anonymous], 1998, REINFORCEMENT LEARNI

[4] The Arcade Learning Environment: An Evaluation Platform for General Agents [J].

Bellemare, Marc G. ;

Naddaf, Yavar ;

Veness, Joel ;

Bowling, Michael .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 :253-279

[5] The SMACH High-Level Executive [J].

Boren, Jonathan ;

Cousins, Steve .

IEEE ROBOTICS & AUTOMATION MAGAZINE, 2010, 17 (04) :18-20

[6]

Cashmore M, 2015, P I C AUTOMAT PLAN S, P333

[7]

Decugis V, 1998, FROM ANIM ANIMAT, P153

[8]

Foukarakis Michalis, 2014, Universal Access in Human-Computer Interaction. Aging and Assistive Environments. 8th International Conference, UAHCI 2014, Held as Part of HCI International 2014. Proceedings: LNCS 8515, P625, DOI 10.1007/978-3-319-07446-7_60

[9] The Metric-FF planning system: Translating "ignoring delete lists" to numeric state variables [J].

Hoffmann, J .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2003, 20 :291-341

[10]

Hrabia C.-E., 2017, COMBINING SELF ORG D, P385

← 1 2 3 →