Semantic Policy Network for Zero-Shot Object Goal Visual Navigation

被引：3

作者：

Zhao, Qianfan ^{[1
,2
]}

Zhang, Lu ^{[1
,2
]}

He, Bin ^{[3
]}

Liu, Zhiyong ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodel Artificial Intelligence S, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China

[3] Tongji Univ, Coll Elect & Informat Engn, Shanghai 200070, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2023年 / 8卷 / 11期

关键词：

Deep learning; path planning; reinforcement learning; vision-based navigation;

D O I：

10.1109/LRA.2023.3320014

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The task of zero-shot object goal visual navigation (ZSON) aims to enable robots to locate previously "unseen" objects by visual observations. This task presents a significant challenge since the robot must transfer the navigation policy learned from "seen" objects to "unseen" objects through auxiliary semantic information without training samples, a process known as zero-shot learning. In order to address this challenge, we propose a novel approach termed the Semantic Policy Network (SPNet). The SPNet consists of two modules that are deeply integrated with semantic embeddings: the Semantic Actor Policy (SAP) module and the Semantic Trajectory (ST) module. The SAP module generates actor network weight bias based on semantic embeddings, creating unique navigation policies for different target classes. The ST module records the robot's actions, visual features, and semantic embeddings at each step, and aggregates information in both the spatial and temporal dimensions. To evaluate our approach, we conducted extensive experiments using MP3D dataset, HM3D dataset, and RoboTHOR. Experimental results indicate that the proposed method outperforms other ZSON methods for both seen and unseen target classes.

引用

页码：7655 / 7662

页数：8

共 50 条

[1] Zero-Shot Object Goal Visual Navigation
Zhao, Qianfan
Zhang, Lu
He, Bin
Qiao, Hong
Liu, Zhiyong
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2025 - 2031
[2] TDANet: Target-Directed Attention Network for Object-Goal Visual Navigation With Zero-Shot Ability
Lian, Shiwei
Zhang, Feitian
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 8075 - 8082
[3] ChatNav: Leveraging LLM to Zero-Shot Semantic Reasoning in Object Navigation
Zhu, Yong
Wen, Zhenyu
Li, Xiong
Shi, Xiufang
Wu, Xiang
Dong, Hui
Chen, Jiming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2369 - 2381
[4] Zero-shot object detection with contrastive semantic association network
Haohe Li
Chong Wang
Weijie Liu
Yilin Gong
Xinmiao Dai
Applied Intelligence, 2023, 53 : 30056 - 30068
[5] Zero-shot object detection with contrastive semantic association network
Li, Haohe
Wang, Chong
Liu, Weijie
Gong, Yilin
Dai, Xinmiao
APPLIED INTELLIGENCE, 2023, 53 (24) : 30056 - 30068
[6] Improved Visual-Semantic Alignment for Zero-Shot Object Detection
Rahman, Shafin
Khan, Salman
Barnes, Nick
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11932 - 11939
[7] Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning
Gao, Rui
Hou, Xingsong
Qin, Jie
Shen, Yuming
Long, Yang
Liu, Li
Zhang, Zhao
Shao, Ling
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1649 - 1664
[8] Semantic-Visual Combination Propagation Network for Zero-Shot Learning
Song, Wenli
Zhang, Lei
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (04) : 2341 - 2345
[9] Prioritized Semantic Learning for Zero-Shot Instance Navigation
Sun, Xinyu
Liu, Lizhao
Zhi, Hongyan
Qiu, Ronghe
Liang, Junwei
COMPUTER VISION - ECCV 2024, PT XII, 2025, 15070 : 161 - 178
[10] Semantic-Visual Consistency Constraint Network for Zero-Shot Image Semantic Segmentation
Chen, Qiong
Feng, Yuan
Li, Zhiqun
Yang, Yong
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2024, 52 (10): : 41 - 50

← 1 2 3 4 5 →