共 53 条
[1]
Neighbor-view Enhanced Model for Vision and Language Navigation
[J].
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021,
2021,
:5101-5109
[2]
Anderson P, 2018, Arxiv, DOI arXiv:1807.06757
[3]
Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:3674-3683
[4]
Matterport3D: Learning from RGB-D Data in Indoor Environments
[J].
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV),
2017,
:667-676
[5]
Semantic Curiosity for Active Visual Learning
[J].
COMPUTER VISION - ECCV 2020, PT VI,
2020, 12351
:309-326
[7]
Reinforced Structured State-Evolution for Vision-Language Navigation
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:15429-15438
[8]
Chen SZ, 2021, ADV NEUR IN, V34
[9]
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:16516-16526
[10]
Deitke M, 2022, Arxiv, DOI [arXiv:2210.06849, DOI 10.48550/ARXIV.2210.06849]