Visual Navigation With Multiple Goals Based on Deep Reinforcement Learning

被引:34
作者
Rao, Zhenhuan [1 ]
Wu, Yuechen [1 ]
Yang, Zifei [1 ]
Zhang, Wei [1 ,2 ]
Lu, Shijian [3 ]
Lu, Weizhi [1 ]
Zha, ZhengJun [4 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[2] Shandong Univ, Inst Brain & Brain Inspired Sci, Jinan 250061, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[4] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
Navigation; Task analysis; Visualization; Training; Reinforcement learning; Computer architecture; Adaptation models; Deep reinforcement learning; scene generalization; visual navigation;
D O I
10.1109/TNNLS.2021.3057424
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning to adapt to a series of different goals in visual navigation is challenging. In this work, we present a model-embedded actor-critic architecture for the multigoal visual navigation task. To enhance the task cooperation in multigoal learning, we introduce two new designs to the reinforcement learning scheme: inverse dynamics model (InvDM) and multigoal colearning (MgCl). Specifically, InvDM is proposed to capture the navigation-relevant association between state and goal and provide additional training signals to relieve the sparse reward issue. MgCl aims at improving the sample efficiency and supports the agent to learn from unintentional positive experiences. Besides, to further improve the scene generalization capability of the agent, we present an enhanced navigation model that consists of two self-supervised auxiliary task modules. The first module, which is named path closed-loop detection, helps to understand whether the state has been experienced. The second one, namely the state-target matching module, tries to figure out the difference between state and goal. Extensive results on the interactive platform AI2-THOR demonstrate that the agent trained with the proposed method converges faster than state-of-the-art methods while owning good generalization capability. The video demonstration is available at https://vsislab.github.io/mgvn.
引用
收藏
页码:5445 / 5455
页数:11
相关论文
共 47 条
  • [1] Learning to See by Moving
    Agrawal, Pulkit
    Carreira, Joao
    Malik, Jitendra
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 37 - 45
  • [2] Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
    Anderson, Peter
    Wu, Qi
    Teney, Damien
    Bruce, Jake
    Johnson, Mark
    Sunderhauf, Niko
    Reid, Ian
    Gould, Stephen
    van den Hengel, Anton
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3674 - 3683
  • [3] Andrychowicz M., 2017, ARXIV170701495
  • [4] Vector-based navigation using grid-like representations in artificial agents
    Banino, Andrea
    Barry, Caswell
    Uria, Benigno
    Blundell, Charles
    Lillicrap, Timothy
    Mirowski, Piotr
    Pritzel, Alexander
    Chadwick, Martin J.
    Degris, Thomas
    Modayil, Joseph
    Wayne, Greg
    Soyer, Hubert
    Viola, Fabio
    Zhang, Brian
    Goroshin, Ross
    Rabinowitz, Neil
    Pascanu, Razvan
    Beattie, Charlie
    Petersen, Stig
    Sadik, Amir
    Gaffney, Stephen
    King, Helen
    Kavukcuoglu, Koray
    Hassabis, Demis
    Hadsell, Raia
    Kumaran, Dharshan
    [J]. NATURE, 2018, 557 (7705) : 429 - +
  • [5] Visual navigation for mobile robots: A survey
    Bonin-Font, Francisco
    Ortiz, Alberto
    Oliver, Gabriel
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2008, 53 (03) : 263 - 296
  • [6] Towards Generalization in Target-Driven Visual Navigation by Using Deep Reinforcement Learning
    Devo, Alessandro
    Mezzetti, Giacomo
    Costante, Gabriele
    Fravolini, Mario L.
    Valigi, Paolo
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (05) : 1546 - 1561
  • [7] Dosovitskiy A., 2016, CoRR
  • [8] Farebrother Jesse., 2018, CoRR
  • [9] Cognitive Mapping and Planning for Visual Navigation
    Gupta, Saurabh
    Davidson, James
    Levine, Sergey
    Sukthankar, Rahul
    Malik, Jitendra
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7272 - 7281
  • [10] Exploiting Generalization in the Subspaces for Faster Model-Based Reinforcement Learning
    Hashemzadeh, Maryam
    Hosseini, Reshad
    Ahmadabadi, Majid Nili
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (06) : 1635 - 1650