A synthesis of automated planning and reinforcement learning for efficient, robust decision-making

被引:63
|
作者
Leonetti, Matteo [1 ,3 ]
Iocchi, Luca [2 ]
Stone, Peter [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, 2317 Speedway,Stop D9500, Austin, TX 78712 USA
[2] Sapienza Univ Rome, Dept Comp Control & Management Engn, Via Ariosto 25, I-00185 Rome, Italy
[3] Univ Leeds, Sch Comp, Leeds LS2 9JT, W Yorkshire, England
基金
美国国家科学基金会;
关键词
Automated planning; Reinforcement learning; Autonomous robot; Robot learning; Answer set programming;
D O I
10.1016/j.artint.2016.07.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated planning and reinforcement learning are characterized by complementary views on decision making: the former relies on previous knowledge and computation, while the latter on interaction with the world, and experience. Planning allows robots to carry out different tasks in the same domain, without the need to acquire knowledge about each one of them, but relies strongly on the accuracy of the model. Reinforcement learning, on the other hand, does not require previous knowledge, and allows robots to robustly adapt to the environment, but often necessitates an infeasible amount of experience. We present Domain Approximation for Reinforcement LearniNG (DARLING), a method that takes advantage of planning to constrain the behavior of the agent to reasonable choices, and of reinforcement learning to adapt to the environment, and increase the reliability of the decision making process. We demonstrate the effectiveness of the proposed method on a service robot, carrying out a variety of tasks in an office building. We find that when the robot makes decisions by planning alone on a given model it often fails, and when it makes decisions by reinforcement learning alone it often cannot complete its tasks in a reasonable amount of time. When employing DARLING, even when seeded with the same model that was used for planning alone, however, the robot can quickly learn a behavior to carry out all the tasks, improves over time, and adapts to, the environment as it changes. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:103 / 130
页数:28
相关论文
共 50 条
  • [1] Robust Multiagent Reinforcement Learning toward Coordinated Decision-Making of Automated Vehicles
    He, Xiangkun
    Chen, Hao
    Lv, Chen
    SAE INTERNATIONAL JOURNAL OF VEHICLE DYNAMICS STABILITY AND NVH, 2023, 7 (04): : 475 - 488
  • [2] PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making
    Yang, Fangkai
    Lyu, Daoming
    Liu, Bo
    Gustafson, Steven
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4860 - 4866
  • [3] Reinforcement learning with hierarchical decision-making
    Cohen, Shahar
    Maimon, Oded
    Khmlenitsky, Evgeni
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, 2006, : 177 - +
  • [4] Deploying Reinforcement Learning for Efficient Runtime Decision-Making in Autonomous Systems
    Dastranj, Melika
    Nia, Mehran Alidoost
    Kargahi, Mehdi
    2022 CPSSI 4TH INTERNATIONAL SYMPOSIUM ON REAL-TIME AND EMBEDDED SYSTEMS AND TECHNOLOGIES (RTEST 2022), 2022,
  • [5] Decision analysis and reinforcement learning in surgical decision-making
    Loftus, Tyler J.
    Filiberto, Amanda C.
    Li, Yanjun
    Balch, Jeremy
    Cook, Allyson C.
    Tighe, Patrick J.
    Efron, Philip A.
    Upchurch, Gilbert R., Jr.
    Rashidi, Parisa
    Li, Xiaolin
    Bihorac, Azra
    SURGERY, 2020, 168 (02) : 253 - 266
  • [6] Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning
    Everett, Michael
    Chen, Yu Fan
    How, Jonathan P.
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 3052 - 3059
  • [7] REINFORCEMENT LEARNING FOR DECISION-MAKING IN A BUSINESS SIMULATOR
    Garcia, Javier
    Borrajo, Fernando
    Fernandez, Fernando
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2012, 11 (05) : 935 - 960
  • [8] Transformer in reinforcement learning for decision-making: a survey
    Yuan, Weilin
    Chen, Jiaxing
    Chen, Shaofei
    Feng, Dawei
    Hu, Zhenzhen
    Li, Peng
    Zhao, Weiwei
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (06) : 763 - 790
  • [9] Invited: Efficient Reinforcement Learning for Automating Human Decision-Making in SoC Design
    Sadasivam, Shankar
    Chen, Zhuo
    Lee, Jinwon
    Jain, Rajeev
    2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [10] Towards an Efficient and Robust Maintenance Decision-making
    Cherkaoui, Hajar
    Khac Tuan Huynh
    Grall, Antoine
    2016 SECOND INTERNATIONAL SYMPOSIUM ON STOCHASTIC MODELS IN RELIABILITY ENGINEERING, LIFE SCIENCE AND OPERATIONS MANAGEMENT (SMRLO), 2016, : 225 - 232