Visual Navigation With Multiple Goals Based on Deep Reinforcement Learning

被引：34

作者：

Rao, Zhenhuan ^{[1
]}

Wu, Yuechen ^{[1
]}

Yang, Zifei ^{[1
]}

Zhang, Wei ^{[1
,2
]}

Lu, Shijian ^{[3
]}

Lu, Weizhi ^{[1
]}

Zha, ZhengJun ^{[4
]}

机构：

[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China

[2] Shandong Univ, Inst Brain & Brain Inspired Sci, Jinan 250061, Peoples R China

[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

[4] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230026, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2021年 / 32卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Navigation; Task analysis; Visualization; Training; Reinforcement learning; Computer architecture; Adaptation models; Deep reinforcement learning; scene generalization; visual navigation;

D O I：

10.1109/TNNLS.2021.3057424

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning to adapt to a series of different goals in visual navigation is challenging. In this work, we present a model-embedded actor-critic architecture for the multigoal visual navigation task. To enhance the task cooperation in multigoal learning, we introduce two new designs to the reinforcement learning scheme: inverse dynamics model (InvDM) and multigoal colearning (MgCl). Specifically, InvDM is proposed to capture the navigation-relevant association between state and goal and provide additional training signals to relieve the sparse reward issue. MgCl aims at improving the sample efficiency and supports the agent to learn from unintentional positive experiences. Besides, to further improve the scene generalization capability of the agent, we present an enhanced navigation model that consists of two self-supervised auxiliary task modules. The first module, which is named path closed-loop detection, helps to understand whether the state has been experienced. The second one, namely the state-target matching module, tries to figure out the difference between state and goal. Extensive results on the interactive platform AI2-THOR demonstrate that the agent trained with the proposed method converges faster than state-of-the-art methods while owning good generalization capability. The video demonstration is available at https://vsislab.github.io/mgvn.

引用

页码：5445 / 5455

页数：11

共 47 条

[1] Learning to See by Moving
Agrawal, Pulkit
Carreira, Joao
Malik, Jitendra
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 37 - 45
[2] Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
Anderson, Peter
Wu, Qi
Teney, Damien
Bruce, Jake
Johnson, Mark
Sunderhauf, Niko
Reid, Ian
Gould, Stephen
van den Hengel, Anton
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3674 - 3683
[3] Andrychowicz M., 2017, ARXIV170701495
[4] Vector-based navigation using grid-like representations in artificial agents
Banino, Andrea
Barry, Caswell
Uria, Benigno
Blundell, Charles
Lillicrap, Timothy
Mirowski, Piotr
Pritzel, Alexander
Chadwick, Martin J.
Degris, Thomas
Modayil, Joseph
Wayne, Greg
Soyer, Hubert
Viola, Fabio
Zhang, Brian
Goroshin, Ross
Rabinowitz, Neil
Pascanu, Razvan
Beattie, Charlie
Petersen, Stig
Sadik, Amir
Gaffney, Stephen
King, Helen
Kavukcuoglu, Koray
Hassabis, Demis
Hadsell, Raia
Kumaran, Dharshan
[J]. NATURE, 2018, 557 (7705) : 429 - +
[5] Visual navigation for mobile robots: A survey
Bonin-Font, Francisco
Ortiz, Alberto
Oliver, Gabriel
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2008, 53 (03) : 263 - 296
[6] Towards Generalization in Target-Driven Visual Navigation by Using Deep Reinforcement Learning
Devo, Alessandro
Mezzetti, Giacomo
Costante, Gabriele
Fravolini, Mario L.
Valigi, Paolo
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (05) : 1546 - 1561
[7] Dosovitskiy A., 2016, CoRR
[8] Farebrother Jesse., 2018, CoRR
[9] Cognitive Mapping and Planning for Visual Navigation
Gupta, Saurabh
Davidson, James
Levine, Sergey
Sukthankar, Rahul
Malik, Jitendra
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7272 - 7281
[10] Exploiting Generalization in the Subspaces for Faster Model-Based Reinforcement Learning
Hashemzadeh, Maryam
Hosseini, Reshad
Ahmadabadi, Majid Nili
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (06) : 1635 - 1650

← 1 2 3 4 5 →