A synthesis of automated planning and reinforcement learning for efficient, robust decision-making

被引：63

作者：

Leonetti, Matteo ^{[1
,3
]}

Iocchi, Luca ^{[2
]}

Stone, Peter ^{[1
]}

机构：

[1] Univ Texas Austin, Dept Comp Sci, 2317 Speedway,Stop D9500, Austin, TX 78712 USA

[2] Sapienza Univ Rome, Dept Comp Control & Management Engn, Via Ariosto 25, I-00185 Rome, Italy

[3] Univ Leeds, Sch Comp, Leeds LS2 9JT, W Yorkshire, England

来源：

ARTIFICIAL INTELLIGENCE | 2016年 / 241卷

基金：

美国国家科学基金会;

关键词：

Automated planning; Reinforcement learning; Autonomous robot; Robot learning; Answer set programming;

D O I：

10.1016/j.artint.2016.07.004

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automated planning and reinforcement learning are characterized by complementary views on decision making: the former relies on previous knowledge and computation, while the latter on interaction with the world, and experience. Planning allows robots to carry out different tasks in the same domain, without the need to acquire knowledge about each one of them, but relies strongly on the accuracy of the model. Reinforcement learning, on the other hand, does not require previous knowledge, and allows robots to robustly adapt to the environment, but often necessitates an infeasible amount of experience. We present Domain Approximation for Reinforcement LearniNG (DARLING), a method that takes advantage of planning to constrain the behavior of the agent to reasonable choices, and of reinforcement learning to adapt to the environment, and increase the reliability of the decision making process. We demonstrate the effectiveness of the proposed method on a service robot, carrying out a variety of tasks in an office building. We find that when the robot makes decisions by planning alone on a given model it often fails, and when it makes decisions by reinforcement learning alone it often cannot complete its tasks in a reasonable amount of time. When employing DARLING, even when seeded with the same model that was used for planning alone, however, the robot can quickly learn a behavior to carry out all the tasks, improves over time, and adapts to, the environment as it changes. (C) 2016 Elsevier B.V. All rights reserved.

引用

页码：103 / 130

页数：28

共 50 条

[1] Robust Multiagent Reinforcement Learning toward Coordinated Decision-Making of Automated Vehicles
He, Xiangkun
Chen, Hao
Lv, Chen
SAE INTERNATIONAL JOURNAL OF VEHICLE DYNAMICS STABILITY AND NVH, 2023, 7 (04): : 475 - 488
[2] PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making
Yang, Fangkai
Lyu, Daoming
Liu, Bo
Gustafson, Steven
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4860 - 4866
[3] Reinforcement learning with hierarchical decision-making
Cohen, Shahar
Maimon, Oded
Khmlenitsky, Evgeni
ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, 2006, : 177 - +
[4] Deploying Reinforcement Learning for Efficient Runtime Decision-Making in Autonomous Systems
Dastranj, Melika
Nia, Mehran Alidoost
Kargahi, Mehdi
2022 CPSSI 4TH INTERNATIONAL SYMPOSIUM ON REAL-TIME AND EMBEDDED SYSTEMS AND TECHNOLOGIES (RTEST 2022), 2022,
[5] Decision analysis and reinforcement learning in surgical decision-making
Loftus, Tyler J.
Filiberto, Amanda C.
Li, Yanjun
Balch, Jeremy
Cook, Allyson C.
Tighe, Patrick J.
Efron, Philip A.
Upchurch, Gilbert R., Jr.
Rashidi, Parisa
Li, Xiaolin
Bihorac, Azra
SURGERY, 2020, 168 (02) : 253 - 266
[6] Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning
Everett, Michael
Chen, Yu Fan
How, Jonathan P.
2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 3052 - 3059
[7] REINFORCEMENT LEARNING FOR DECISION-MAKING IN A BUSINESS SIMULATOR
Garcia, Javier
Borrajo, Fernando
Fernandez, Fernando
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2012, 11 (05) : 935 - 960
[8] Transformer in reinforcement learning for decision-making: a survey
Yuan, Weilin
Chen, Jiaxing
Chen, Shaofei
Feng, Dawei
Hu, Zhenzhen
Li, Peng
Zhao, Weiwei
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (06) : 763 - 790
[9] Invited: Efficient Reinforcement Learning for Automating Human Decision-Making in SoC Design
Sadasivam, Shankar
Chen, Zhuo
Lee, Jinwon
Jain, Rajeev
2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
[10] Towards an Efficient and Robust Maintenance Decision-making
Cherkaoui, Hajar
Khac Tuan Huynh
Grall, Antoine
2016 SECOND INTERNATIONAL SYMPOSIUM ON STOCHASTIC MODELS IN RELIABILITY ENGINEERING, LIFE SCIENCE AND OPERATIONS MANAGEMENT (SMRLO), 2016, : 225 - 232

← 1 2 3 4 5 →