LLM as Copilot for Coarse-Grained Vision-and-Language Navigation

被引:0
|
作者
Qiao, Yanyuan [1 ]
Liu, Qianyi [2 ,3 ]
Liu, Jiajun [4 ,5 ]
Liu, Jing [2 ,3 ]
Wu, Qi [1 ]
机构
[1] Univ Adelaide, Australian Inst Machine Learning, Adelaide, SA, Australia
[2] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[4] CSIRO Data61, Eveleigh, Australia
[5] Univ Queensland, Brisbane, Qld, Australia
来源
COMPUTER VISION - ECCV 2024, PT V | 2025年 / 15063卷
关键词
Vision-and-Language; Navigation; Large Language; Models;
D O I
10.1007/978-3-031-72652-1_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision-and-Language Navigation (VLN) involves guiding an agent through indoor environments using human-provided textual instructions. Coarse-grained VLN, with short and high-level instructions, has gained popularity as it closely mirrors real-world scenarios. However, a significant challenge is these instructions are often too concise for agents to comprehend and act upon. Previous studies have explored allowing agents to seek assistance during navigation, but typically offer rigid support from pre-existing datasets or simulators. The advent of Large Language Models (LLMs) presents a novel avenue for aiding VLN agents. This paper introduces VLN-Copilot, a framework enabling agents to actively seek assistance when encountering confusion, with the LLM serving as a copilot to facilitate navigation. Our approach includes the introduction of a confusion score, quantifying the level of uncertainty in an agent's action decisions, while the LLM offers real-time detailed guidance for navigation. Experimental results on two coarse-grained VLN datasets show the efficacy of our method.
引用
收藏
页码:459 / 476
页数:18
相关论文
共 50 条
  • [31] Deep coarse-grained potentials via relative entropy minimization
    Thaler, Stephan
    Stupp, Maximilian
    Zavadlav, Julija
    JOURNAL OF CHEMICAL PHYSICS, 2022, 157 (24)
  • [32] Coarse-Grained Molecular Dynamics Simulation of a Red Blood Cell
    Jiang Li-Guo
    Wu Heng-An
    Zhou Xiao-Zhou
    Wang Xiu-Xi
    CHINESE PHYSICS LETTERS, 2010, 27 (02)
  • [33] Protein Corona on Gold Nanoparticles Studied with Coarse-Grained Simulations
    Sajib, Md Symon Jahan
    Sarker, Pranab
    Wei, Yong
    Tao, Xiuping
    Wei, Tao
    LANGMUIR, 2020, 36 (44) : 13356 - 13363
  • [34] Coarse-grained molecular dynamics simulation of polymers: Structures and dynamics
    Shi, Rui
    Qian, Hu-Jun
    Lu, Zhong-Yuan
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2023, 13 (06)
  • [35] Towards Deep Generative Backmapping of Coarse-Grained Molecular Systems
    Li, Jiasheng
    Meng, Zaiqiao
    Liang, Shangsong
    2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
  • [36] FashionViL: Fashion-Focused Vision-and-Language Representation Learning
    Han, Xiao
    Yu, Licheng
    Zhu, Xiatian
    Zhang, Li
    Song, Yi-Zhe
    Xiang, Tao
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 634 - 651
  • [37] Conservative Potentials for a Lattice-Mapped Coarse-Grained Scheme
    Luo, Siwei
    Thachuk, Mark
    JOURNAL OF PHYSICAL CHEMISTRY A, 2021, 125 (29) : 6486 - 6497
  • [38] The derivation and approximation of coarse-grained dynamics from Langevin dynamics
    Ma, Lina
    Li, Xiantao
    Liu, Chun
    JOURNAL OF CHEMICAL PHYSICS, 2016, 145 (20)
  • [39] Coarse-grained Monte Carlo Simulation of Excitation and Ionization Collisions
    Le, Hai P.
    Yan, Bokai
    Caflisch, Russel E.
    Cambier, Jean-Luc
    30TH INTERNATIONAL SYMPOSIUM ON RAREFIED GAS DYNAMICS (RGD 30), 2016, 1786
  • [40] Coarse-grained and fine-grained turbidite systems as end member models: applicability and dangers
    Bouma, AH
    MARINE AND PETROLEUM GEOLOGY, 2000, 17 (02) : 137 - 143