A synthesis of automated planning and reinforcement learning for efficient, robust decision-making

被引:63
|
作者
Leonetti, Matteo [1 ,3 ]
Iocchi, Luca [2 ]
Stone, Peter [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, 2317 Speedway,Stop D9500, Austin, TX 78712 USA
[2] Sapienza Univ Rome, Dept Comp Control & Management Engn, Via Ariosto 25, I-00185 Rome, Italy
[3] Univ Leeds, Sch Comp, Leeds LS2 9JT, W Yorkshire, England
基金
美国国家科学基金会;
关键词
Automated planning; Reinforcement learning; Autonomous robot; Robot learning; Answer set programming;
D O I
10.1016/j.artint.2016.07.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated planning and reinforcement learning are characterized by complementary views on decision making: the former relies on previous knowledge and computation, while the latter on interaction with the world, and experience. Planning allows robots to carry out different tasks in the same domain, without the need to acquire knowledge about each one of them, but relies strongly on the accuracy of the model. Reinforcement learning, on the other hand, does not require previous knowledge, and allows robots to robustly adapt to the environment, but often necessitates an infeasible amount of experience. We present Domain Approximation for Reinforcement LearniNG (DARLING), a method that takes advantage of planning to constrain the behavior of the agent to reasonable choices, and of reinforcement learning to adapt to the environment, and increase the reliability of the decision making process. We demonstrate the effectiveness of the proposed method on a service robot, carrying out a variety of tasks in an office building. We find that when the robot makes decisions by planning alone on a given model it often fails, and when it makes decisions by reinforcement learning alone it often cannot complete its tasks in a reasonable amount of time. When employing DARLING, even when seeded with the same model that was used for planning alone, however, the robot can quickly learn a behavior to carry out all the tasks, improves over time, and adapts to, the environment as it changes. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:103 / 130
页数:28
相关论文
共 50 条
  • [21] A Comprehensive Driving Decision-Making Methodology Based on Deep Reinforcement Learning for Automated Commercial Vehicles
    Hu, Weiming
    Li, Xu
    Hu, Jinchao
    Liu, Yan
    Zhou, Jinying
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2024, 25 (06) : 1469 - 1483
  • [22] Decision-Making System for Lane Change Using Deep Reinforcement Learning in Connected and Automated Driving
    An, HongIl
    Jung, Jae-il
    ELECTRONICS, 2019, 8 (05):
  • [23] Automated decision-making
    Ivanov, Stanislav Hristov
    FORESIGHT, 2023, 25 (01): : 4 - 19
  • [24] Toward Trustworthy Decision-Making for Autonomous Vehicles: A Robust Reinforcement Learning Approach with Safety Guarantees
    He, Xiangkun
    Huang, Wenhui
    Lv, Chen
    ENGINEERING, 2024, 33 : 77 - 89
  • [25] A Decision-Making Framework for Load Rating Planning of Aging Bridges Using Deep Reinforcement Learning
    Cheng, Minghui
    Frangopol, Dan M.
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2021, 35 (06)
  • [26] Augmented Reinforcement Learning with Efficient Social-based Motion Prediction for Autonomous Decision-Making
    Gutierrez-Moreno, Rodrigo
    Gomez-Huelamo, Carlos
    Barea, Rafael
    Lopez-Guillen, Elena
    Arango, Felipe
    Bergasa, Luis M.
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 4167 - 4172
  • [27] Adaptive Decision-Making for Automated Vehicles Under Roundabout Scenarios Using Optimization Embedded Reinforcement Learning
    Zhang, Yuxiang
    Gao, Bingzhao
    Guo, Lulu
    Guo, Hongyan
    Chen, Hong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (12) : 5526 - 5538
  • [28] Decision-making for Connected and Automated Vehicles in Chanllenging Traffic Conditions Using Imitation and Deep Reinforcement Learning
    Hu, Jinchao
    Li, Xu
    Hu, Weiming
    Xu, Qimin
    Hu, Yue
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2023, 24 (06) : 1589 - 1602
  • [29] Decision-making for Connected and Automated Vehicles in Chanllenging Traffic Conditions Using Imitation and Deep Reinforcement Learning
    Jinchao Hu
    Xu Li
    Weiming Hu
    Qimin Xu
    Yue Hu
    International Journal of Automotive Technology, 2023, 24 : 1589 - 1602
  • [30] Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making
    Desai, Nishant
    Critch, Andrew
    Russell, Stuart
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31