An Object-Driven Navigation Strategy Based on Active Perception and Semantic Association

被引:1
作者
Guo, Yu [1 ,2 ]
Sun, Jinsheng [1 ,2 ]
Zhang, Ruiheng [1 ,2 ]
Jiang, Zhiqi [1 ,2 ]
Mi, Zhenqiang [1 ,2 ]
Yao, Chao [1 ,2 ]
Ban, Xiaojuan [1 ,2 ,3 ]
Obaidat, Mohammad S. [4 ,5 ,6 ]
机构
[1] Univ Sci & Technol Beijing, Beijing Adv Innovat Ctr Mat Genome Engn, Sch Intelligence Sci & Technol, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Shunde Innovat Sch, Beijing, Peoples R China
[3] Liaoning Acad Mat, Inst Mat Intelligent Technol, Shenyang 110004, Peoples R China
[4] Univ Jordan, King AbdullahSchool Informat Technol 2, Amman 11942, Jordan
[5] Amity Univ, Sch Engn, Noida 201301, India
[6] SRM Univ, Sch Comp, Kattankulathur 603203, India
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 08期
基金
中国国家自然科学基金;
关键词
Navigation; Semantics; Task analysis; Robot kinematics; Active perception; Accuracy; Visualization; Perception-action coupling; semantic scene understanding; vision-based navigation; NETWORK;
D O I
10.1109/LRA.2024.3418269
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Efficiently navigating to a specific kind of objects in an unknown environment is an important and challenging research topic in Embodied AI. Existing methods, such as end-to-end learning ones and modular ones still struggle at this task as they have poor efficiency, interpretability and/or generalization. This letter proposes an object-driven navigation strategy for robots based on active perception and semantic association. Specifically, this strategy enables robots to explore the environment and complete object-driven navigation tasks by endowing them with the ability to explore areas, infer associations, remember scenes, continue navigation, and so on. On this basis, this letter designs a semantic association model based on a graph convolution network using rich information hidden in the environment, including geometric and semantic information, to dynamically predict the association between objects, providing more accurate and effective prior knowledge for object-driven navigation. Experimental results prove that the proposed strategy can help robots better perceive and understand the environment and it is superior to similar strategies in terms of success rate and navigation efficiency.
引用
收藏
页码:7110 / 7117
页数:8
相关论文
共 28 条
  • [1] Anderson P, 2018, Arxiv, DOI [arXiv:1807.06757, 10.48550/ARXIV.1807.06757]
  • [2] Online Learning of Reusable Abstract Models for Object Goal Navigation
    Campari, Tommaso
    Lamanna, Leonardo
    Traverso, Paolo
    Serafini, Luciano
    Ballan, Lamberto
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14850 - 14859
  • [3] Chaplot D. S., 2020, P 8 INT C LEARN REPR, P2193
  • [4] Chaplot DS, 2020, ADV NEUR IN, V33
  • [5] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
    Dai, Angela
    Chang, Angel X.
    Savva, Manolis
    Halber, Maciej
    Funkhouser, Thomas
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2432 - 2443
  • [6] Visual Object Search by Learning Spatial Context
    Druon, Raphael
    Yoshiyasu, Yusuke
    Kanezaki, Asako
    Watt, Alassane
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) : 1279 - 1286
  • [7] Navigating to objects in the real world
    Gervet, Theophile
    Chintala, Soumith
    Batra, Dhruv
    Malik, Jitendra
    Chaplot, Devendra Singh
    [J]. SCIENCE ROBOTICS, 2023, 8 (79)
  • [8] Spatial Commonsense Graph for Object Localisation in Partial Scenes
    Giuliari, Francesco
    Skenderi, Geri
    Cristani, Marco
    Wang, Yiming
    Del Bue, Alessio
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19496 - 19505
  • [9] A Review of Deep Learning-Based Visual Multi-Object Tracking Algorithms for Autonomous Driving
    Guo, Shuman
    Wang, Shichang
    Yang, Zhenzhong
    Wang, Lijun
    Zhang, Huawei
    Guo, Pengyan
    Gao, Yuguo
    Guo, Junkai
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [10] RIANet++: Road Graph and Image Attention Networks for Robust Urban Autonomous Driving Under Road Changes
    Ha, Taeoh
    Oh, Jeongwoo
    Lee, Gunmin
    Heo, Jaeseok
    Kim, Do Hyung
    Park, Byungkyu
    Lee, Chang-Gun
    Oh, Songhwai
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (11) : 7815 - 7822