Detect, Understand, Act: A Neuro-symbolic Hierarchical Reinforcement Learning Framework

被引:5
作者
Mitchener, Ludovico [1 ]
Tuckey, David [1 ]
Crosby, Matthew [1 ]
Russo, Alessandra [1 ]
机构
[1] Imperial Coll London, Exhibit Rd, London SW7 2BX, England
关键词
Neuro-symbolic; Hierarchical reinforcement learning; Deep reinforcement learning; Inductive logic programming; Answer set programming;
D O I
10.1007/s10994-022-06142-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce Detect, Understand, Act (DUA), a neuro-symbolic reinforcement learning framework. The Detect component is composed of a traditional computer vision object detector and tracker. The Act component houses a set of options, high-level actions enacted by pre-trained deep reinforcement learning (DRL) policies. The Understand component provides a novel answer set programming (ASP) paradigm for symbolically implementing a meta-policy over options and effectively learning it using inductive logic programming (ILP). We evaluate our framework on the Animal-AI (AAI) competition testbed, a set of physical cognitive reasoning problems. Given a set of pre-trained DRL policies, DUA requires only a few examples to learn a meta-policy that allows it to improve the state-of-the-art on multiple of the most challenging categories from the testbed. DUA constitutes the first holistic hybrid integration of computer vision, ILP and DRL applied to an AAI-like environment and sets the foundations for further use of ILP in complex DRL challenges.
引用
收藏
页码:1523 / 1549
页数:27
相关论文
共 50 条
  • [1] Anderson G, 2020, ADV NEUR IN, V33
  • [2] Andreas J, 2017, PR MACH LEARN RES, V70
  • [3] [Anonymous], 2018, arXiv preprint arXiv:1810.09202
  • [4] Berner C., 2019, ARXIV
  • [5] Booch G., 2020, Thinking fast and slow in ai
  • [6] Combining Deep Reinforcement Learning with Prior Knowledge and Reasoning
    Bougie, Nicolas
    Cheng, Li Kai
    Ichise, Ryutaro
    [J]. APPLIED COMPUTING REVIEW, 2018, 18 (02): : 33 - 45
  • [7] ASP-Core-2 Input Language Format
    Calimeri, Francesco
    Faber, Wolfgang
    Gebser, Martin
    Ianni, Giovambattista
    Kaminski, Roland
    Krennwallner, Thomas
    Leone, Nicola
    Maratea, Marco
    Ricca, Francesco
    Schaub, Torsten
    [J]. THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2020, 20 (02) : 294 - 309
  • [8] Clark KL., 1987, READINGS NONMONOTONI, P311
  • [9] Clark P., 2019, F A NY REGENTS SCI E
  • [10] Cranmer Miles D, 2019, Learning symbolic physics with graph networks