Online Learning of Reusable Abstract Models for Object Goal Navigation

被引：11

作者：

Campari, Tommaso ^{[1
,2
]}

Lamanna, Leonardo ^{[2
,3
]}

Traverso, Paolo ^{[2
]}

Serafini, Luciano ^{[2
]}

Ballan, Lamberto ^{[1
]}

机构：

[1] Univ Padua, Padua, Italy

[2] Fdn Bruno Kessler FBK, Trento, Italy

[3] Univ Brescia, Brescia, Italy

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

D O I：

10.1109/CVPR52688.2022.01445

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a novel approach to incrementally learn an Abstract Model of an unknown environment, and show how an agent can reuse the learned model for tackling the Object Goal Navigation task. The Abstract Model is a finite state machine in which each state is an abstraction of a state of the environment, as perceived by the agent in a certain position and orientation. The perceptions are high-dimensional sensory data (e.g., RGB-D images), and the abstraction is reached by exploiting image segmentation and the Taskonomy model bank. The learning of the Abstract Model is accomplished by executing actions, observing the reached state, and updating the Abstract Model with the acquired information. The learned models are memorized by the agent, and they are reused whenever it recognizes to be in an environment that corresponds to the stored model. We investigate the effectiveness of the proposed approach for the Object Goal Navigation task, relying on public benchmarks. Our results show that the reuse of learned Abstract Models can boost performance on Object Goal Navigation.

引用

页码：14850 / 14859

页数：10

共 48 条

[1] Learning action models with minimal observability
Aineto, Diego
Jimenez Celorrio, Sergio
Onaindia, Eva
[J]. ARTIFICIAL INTELLIGENCE, 2019, 275 : 104 - 137
[2] Anderson P, 2018, Arxiv, DOI [arXiv:1807.06757, 10.48550/ARXIV.1807.06757]
[3] Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
Anderson, Peter
Wu, Qi
Teney, Damien
Bruce, Jake
Johnson, Mark
Sunderhauf, Niko
Reid, Ian
Gould, Stephen
van den Hengel, Anton
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3674 - 3683
[4] Asai M, 2018, AAAI CONF ARTIF INTE, P6094
[5] Asai Masataro., 2019, PROC INT C AUTOMATED, V29, P592
[6] Batra D, 2020, Arxiv, DOI [arXiv:2006.13171, DOI 10.48550/ARXIV.2006.13171, 10.48550/arXiv.2006.13171]
[7] Batra D, 2020, Arxiv, DOI arXiv:2011.01975
[8] Learning First-Order Symbolic Representations for Planning from the Structure of the State Space
Bonet, Blai
Geffner, Hector
[J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2322 - 2329
[9] Campari Tommaso, 2020, PROC EUROPEAN C COMP
[10] Cartillier Vincent, 2021, PROC AAAI C ARTIFICI

← 1 2 3 4 5 →