Online Learning of Reusable Abstract Models for Object Goal Navigation

被引:11
作者
Campari, Tommaso [1 ,2 ]
Lamanna, Leonardo [2 ,3 ]
Traverso, Paolo [2 ]
Serafini, Luciano [2 ]
Ballan, Lamberto [1 ]
机构
[1] Univ Padua, Padua, Italy
[2] Fdn Bruno Kessler FBK, Trento, Italy
[3] Univ Brescia, Brescia, Italy
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.01445
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel approach to incrementally learn an Abstract Model of an unknown environment, and show how an agent can reuse the learned model for tackling the Object Goal Navigation task. The Abstract Model is a finite state machine in which each state is an abstraction of a state of the environment, as perceived by the agent in a certain position and orientation. The perceptions are high-dimensional sensory data (e.g., RGB-D images), and the abstraction is reached by exploiting image segmentation and the Taskonomy model bank. The learning of the Abstract Model is accomplished by executing actions, observing the reached state, and updating the Abstract Model with the acquired information. The learned models are memorized by the agent, and they are reused whenever it recognizes to be in an environment that corresponds to the stored model. We investigate the effectiveness of the proposed approach for the Object Goal Navigation task, relying on public benchmarks. Our results show that the reuse of learned Abstract Models can boost performance on Object Goal Navigation.
引用
收藏
页码:14850 / 14859
页数:10
相关论文
共 48 条
  • [1] Learning action models with minimal observability
    Aineto, Diego
    Jimenez Celorrio, Sergio
    Onaindia, Eva
    [J]. ARTIFICIAL INTELLIGENCE, 2019, 275 : 104 - 137
  • [2] Anderson P, 2018, Arxiv, DOI [arXiv:1807.06757, 10.48550/ARXIV.1807.06757]
  • [3] Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
    Anderson, Peter
    Wu, Qi
    Teney, Damien
    Bruce, Jake
    Johnson, Mark
    Sunderhauf, Niko
    Reid, Ian
    Gould, Stephen
    van den Hengel, Anton
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3674 - 3683
  • [4] Asai M, 2018, AAAI CONF ARTIF INTE, P6094
  • [5] Asai Masataro., 2019, PROC INT C AUTOMATED, V29, P592
  • [6] Batra D, 2020, Arxiv, DOI [arXiv:2006.13171, DOI 10.48550/ARXIV.2006.13171, 10.48550/arXiv.2006.13171]
  • [7] Batra D, 2020, Arxiv, DOI arXiv:2011.01975
  • [8] Learning First-Order Symbolic Representations for Planning from the Structure of the State Space
    Bonet, Blai
    Geffner, Hector
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2322 - 2329
  • [9] Campari Tommaso, 2020, PROC EUROPEAN C COMP
  • [10] Cartillier Vincent, 2021, PROC AAAI C ARTIFICI