Semantic Policy Network for Zero-Shot Object Goal Visual Navigation

被引:3
|
作者
Zhao, Qianfan [1 ,2 ]
Zhang, Lu [1 ,2 ]
He, Bin [3 ]
Liu, Zhiyong [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodel Artificial Intelligence S, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China
[3] Tongji Univ, Coll Elect & Informat Engn, Shanghai 200070, Peoples R China
关键词
Deep learning; path planning; reinforcement learning; vision-based navigation;
D O I
10.1109/LRA.2023.3320014
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The task of zero-shot object goal visual navigation (ZSON) aims to enable robots to locate previously "unseen" objects by visual observations. This task presents a significant challenge since the robot must transfer the navigation policy learned from "seen" objects to "unseen" objects through auxiliary semantic information without training samples, a process known as zero-shot learning. In order to address this challenge, we propose a novel approach termed the Semantic Policy Network (SPNet). The SPNet consists of two modules that are deeply integrated with semantic embeddings: the Semantic Actor Policy (SAP) module and the Semantic Trajectory (ST) module. The SAP module generates actor network weight bias based on semantic embeddings, creating unique navigation policies for different target classes. The ST module records the robot's actions, visual features, and semantic embeddings at each step, and aggregates information in both the spatial and temporal dimensions. To evaluate our approach, we conducted extensive experiments using MP3D dataset, HM3D dataset, and RoboTHOR. Experimental results indicate that the proposed method outperforms other ZSON methods for both seen and unseen target classes.
引用
收藏
页码:7655 / 7662
页数:8
相关论文
共 50 条
  • [21] Zero-Shot Object Recognition Using Semantic Label Vectors
    Naha, Shujon
    Wang, Yang
    2015 12TH CONFERENCE ON COMPUTER AND ROBOT VISION CRV 2015, 2015, : 94 - 100
  • [22] Zero-shot Object Prediction using Semantic Scene Knowledge
    Grzeszick, Rene
    Fink, Gernot A.
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 120 - 129
  • [23] A dynamic semantic knowledge graph for zero-shot object detection
    Wen Lv
    Hongbo Shi
    Shuai Tan
    Bing Song
    Yang Tao
    The Visual Computer, 2023, 39 : 4513 - 4527
  • [24] A dynamic semantic knowledge graph for zero-shot object detection
    Lv, Wen
    Shi, Hongbo
    Tan, Shuai
    Song, Bing
    Tao, Yang
    VISUAL COMPUTER, 2023, 39 (10): : 4513 - 4527
  • [25] Learning Latent Semantic Attributes for Zero-Shot Object Detection
    Wang, Kang
    Zhang, Lu
    Tan, Yifan
    Zhao, Jiajia
    Zhou, Shuigeng
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 230 - 237
  • [26] Adaptive adjustment with semantic embedding for zero-shot object detection
    Lv, Wen
    Shi, Hongbo
    Tan, Shuai
    Song, Bing
    Tao, Yang
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [27] Attentive Semantic Preservation Network for Zero-Shot Learning
    Lu, Ziqian
    Yu, Yunlong
    Lu, Zhe-Ming
    Shen, Feng-Li
    Zhang, Zhongfei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2919 - 2925
  • [28] Zero-shot Object Detection Based on Dynamic Semantic Vectors
    Li, Haoyu
    Mei, Jilin
    Zhou, Jiancong
    Hu, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9267 - 9273
  • [29] Semantic combined network for zero-shot scene parsing
    Wang, Yinduo
    Zhang, Haofeng
    Wang, Shidong
    Long, Yang
    Yang, Longzhi
    IET IMAGE PROCESSING, 2020, 14 (04) : 757 - 765
  • [30] Zero-shot Semantic Segmentation Using Relation Network
    Zhang, Yindong
    Khriyenko, Oleksiy
    PROCEEDINGS OF THE 28TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2021, : 516 - 527