Detect, Understand, Act: A Neuro-symbolic Hierarchical Reinforcement Learning Framework

被引：5

作者：

Mitchener, Ludovico ^{[1
]}

Tuckey, David ^{[1
]}

Crosby, Matthew ^{[1
]}

Russo, Alessandra ^{[1
]}

机构：

[1] Imperial Coll London, Exhibit Rd, London SW7 2BX, England

来源：

MACHINE LEARNING | 2022年 / 111卷 / 04期

关键词：

Neuro-symbolic; Hierarchical reinforcement learning; Deep reinforcement learning; Inductive logic programming; Answer set programming;

D O I：

10.1007/s10994-022-06142-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we introduce Detect, Understand, Act (DUA), a neuro-symbolic reinforcement learning framework. The Detect component is composed of a traditional computer vision object detector and tracker. The Act component houses a set of options, high-level actions enacted by pre-trained deep reinforcement learning (DRL) policies. The Understand component provides a novel answer set programming (ASP) paradigm for symbolically implementing a meta-policy over options and effectively learning it using inductive logic programming (ILP). We evaluate our framework on the Animal-AI (AAI) competition testbed, a set of physical cognitive reasoning problems. Given a set of pre-trained DRL policies, DUA requires only a few examples to learn a meta-policy that allows it to improve the state-of-the-art on multiple of the most challenging categories from the testbed. DUA constitutes the first holistic hybrid integration of computer vision, ILP and DRL applied to an AAI-like environment and sets the foundations for further use of ILP in complex DRL challenges.

引用

页码：1523 / 1549

页数：27

共 50 条

[1] Anderson G, 2020, ADV NEUR IN, V33
[2] Andreas J, 2017, PR MACH LEARN RES, V70
[3] [Anonymous], 2018, arXiv preprint arXiv:1810.09202
[4] Berner C., 2019, ARXIV
[5] Booch G., 2020, Thinking fast and slow in ai
[6] Combining Deep Reinforcement Learning with Prior Knowledge and Reasoning
Bougie, Nicolas
Cheng, Li Kai
Ichise, Ryutaro
[J]. APPLIED COMPUTING REVIEW, 2018, 18 (02): : 33 - 45
[7] ASP-Core-2 Input Language Format
Calimeri, Francesco
Faber, Wolfgang
Gebser, Martin
Ianni, Giovambattista
Kaminski, Roland
Krennwallner, Thomas
Leone, Nicola
Maratea, Marco
Ricca, Francesco
Schaub, Torsten
[J]. THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2020, 20 (02) : 294 - 309
[8] Clark KL., 1987, READINGS NONMONOTONI, P311
[9] Clark P., 2019, F A NY REGENTS SCI E
[10] Cranmer Miles D, 2019, Learning symbolic physics with graph networks

← 1 2 3 4 5 →