Symmetry-aware Neural Architecture for Embodied Visual Navigation

被引:0
|
作者
Liu, Shuang [1 ]
Suganuma, Masanori [2 ]
Okatani, Takayuki [1 ,2 ]
机构
[1] RIKEN, Ctr AIP, Tokyo 1030027, Japan
[2] Tohoku Univ, GSIS, Sendai, Miyagi 9808579, Japan
关键词
Embodied visual navigation; Symmetry; Generalization; Equivariance; Invariance;
D O I
10.1007/s11263-023-01909-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existing methods for addressing visual navigation employ deep reinforcement learning as the standard tool for the task. However, they tend to be vulnerable to statistical shifts between the training and test data, resulting in poor generalization over novel environments that are out-of-distribution from the training data. In this study, we attempt to improve the generalization ability by utilizing the inductive biases available for the task. Employing the active neural SLAM that learns policies with the advantage actor-critic method as the base framework, we first point out that the mappings represented by the actor and the critic should satisfy specific symmetries. We then propose a network design for the actor and the critic to inherently attain these symmetries. Specifically, we use G-convolution instead of the standard convolution and insert the semi-global polar pooling layer, which we newly design in this study, in the last section of the critic network. Our method can be integrated into existing methods that utilize intermediate goals and 2D occupancy maps. Experimental results show that our method improves generalization ability by a good margin over visual exploration and object goal navigation, which are two main embodied visual navigation tasks.
引用
收藏
页码:1091 / 1107
页数:17
相关论文
共 50 条
  • [31] Learning symmetry-aware atom mapping in chemical reactions through deep graph matching
    Astero, Maryam
    Rousu, Juho
    JOURNAL OF CHEMINFORMATICS, 2024, 16 (01)
  • [32] Last-Mile Embodied Visual Navigation
    Wasserman, Justin
    Yadav, Karmesh
    Chowdhary, Girish
    Gupta, Abhinav
    Jain, Unnat
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 666 - 678
  • [33] Echo-Enhanced Embodied Visual Navigation
    Yu, Yinfeng
    Cao, Lele
    Sun, Fuchun
    Yang, Chao
    Lai, Huicheng
    Huang, Wenbing
    NEURAL COMPUTATION, 2023, 35 (05) : 958 - 976
  • [34] Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation
    Tang, Tianqi
    Yu, Xin
    Dong, Xuanyi
    Yang, Yi
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3742 - 3751
  • [35] Symmetry-aware kinematic skeleton generation of a 3D human body model
    Luo, Shan
    Feng, Jieqing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (29-30) : 20579 - 20602
  • [36] SYM-FISH: A Symmetry-aware Flip Invariant Sketch Histogram Shape Descriptor
    Cao, Xiaochun
    Zhang, Hua
    Liu, Si
    Guo, Xiaojie
    Lin, Liang
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 313 - 320
  • [37] Symmetry-aware kinematic skeleton generation of a 3D human body model
    Shan Luo
    Jieqing Feng
    Multimedia Tools and Applications, 2020, 79 : 20579 - 20602
  • [38] Learning Symmetry-Aware Geometry Correspondences for 6D Object Pose Estimation
    Zhao, Heng
    Wei, Shenxing
    Shi, Dahu
    Tan, Wenming
    Li, Zheyang
    Ren, Ye
    Wei, Xing
    Yang, Yi
    Pu, Shiliang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13999 - 14008
  • [39] PCBSSD: Self-supervised symmetry-aware detector for PCB displacement and orientation inspection
    Li, Jingxuan
    Da, Feipeng
    Yu, Yi
    MEASUREMENT, 2025, 243
  • [40] Neural Network Architecture of Embodied Intelligence
    A. R. Nurutdinov
    Automatic Documentation and Mathematical Linguistics, 2024, 58 (Suppl 4) : S241 - S264