Semantic Policy Network for Zero-Shot Object Goal Visual Navigation

被引:3
|
作者
Zhao, Qianfan [1 ,2 ]
Zhang, Lu [1 ,2 ]
He, Bin [3 ]
Liu, Zhiyong [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodel Artificial Intelligence S, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China
[3] Tongji Univ, Coll Elect & Informat Engn, Shanghai 200070, Peoples R China
关键词
Deep learning; path planning; reinforcement learning; vision-based navigation;
D O I
10.1109/LRA.2023.3320014
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The task of zero-shot object goal visual navigation (ZSON) aims to enable robots to locate previously "unseen" objects by visual observations. This task presents a significant challenge since the robot must transfer the navigation policy learned from "seen" objects to "unseen" objects through auxiliary semantic information without training samples, a process known as zero-shot learning. In order to address this challenge, we propose a novel approach termed the Semantic Policy Network (SPNet). The SPNet consists of two modules that are deeply integrated with semantic embeddings: the Semantic Actor Policy (SAP) module and the Semantic Trajectory (ST) module. The SAP module generates actor network weight bias based on semantic embeddings, creating unique navigation policies for different target classes. The ST module records the robot's actions, visual features, and semantic embeddings at each step, and aggregates information in both the spatial and temporal dimensions. To evaluate our approach, we conducted extensive experiments using MP3D dataset, HM3D dataset, and RoboTHOR. Experimental results indicate that the proposed method outperforms other ZSON methods for both seen and unseen target classes.
引用
收藏
页码:7655 / 7662
页数:8
相关论文
共 50 条
  • [1] Zero-Shot Object Goal Visual Navigation
    Zhao, Qianfan
    Zhang, Lu
    He, Bin
    Qiao, Hong
    Liu, Zhiyong
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2025 - 2031
  • [2] TDANet: Target-Directed Attention Network for Object-Goal Visual Navigation With Zero-Shot Ability
    Lian, Shiwei
    Zhang, Feitian
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 8075 - 8082
  • [3] ChatNav: Leveraging LLM to Zero-Shot Semantic Reasoning in Object Navigation
    Zhu, Yong
    Wen, Zhenyu
    Li, Xiong
    Shi, Xiufang
    Wu, Xiang
    Dong, Hui
    Chen, Jiming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2369 - 2381
  • [4] Zero-shot object detection with contrastive semantic association network
    Haohe Li
    Chong Wang
    Weijie Liu
    Yilin Gong
    Xinmiao Dai
    Applied Intelligence, 2023, 53 : 30056 - 30068
  • [5] Zero-shot object detection with contrastive semantic association network
    Li, Haohe
    Wang, Chong
    Liu, Weijie
    Gong, Yilin
    Dai, Xinmiao
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30056 - 30068
  • [6] Improved Visual-Semantic Alignment for Zero-Shot Object Detection
    Rahman, Shafin
    Khan, Salman
    Barnes, Nick
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11932 - 11939
  • [7] Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning
    Gao, Rui
    Hou, Xingsong
    Qin, Jie
    Shen, Yuming
    Long, Yang
    Liu, Li
    Zhang, Zhao
    Shao, Ling
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1649 - 1664
  • [8] Semantic-Visual Combination Propagation Network for Zero-Shot Learning
    Song, Wenli
    Zhang, Lei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (04) : 2341 - 2345
  • [9] Prioritized Semantic Learning for Zero-Shot Instance Navigation
    Sun, Xinyu
    Liu, Lizhao
    Zhi, Hongyan
    Qiu, Ronghe
    Liang, Junwei
    COMPUTER VISION - ECCV 2024, PT XII, 2025, 15070 : 161 - 178
  • [10] Semantic-Visual Consistency Constraint Network for Zero-Shot Image Semantic Segmentation
    Chen, Qiong
    Feng, Yuan
    Li, Zhiqun
    Yang, Yong
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2024, 52 (10): : 41 - 50