共 62 条
[41]
Redmon J., 2018, arXiv, DOI DOI 10.48550/ARXIV.1804.02767
[43]
Key-Word-Aware Network for Referring Expression Image Segmentation
[J].
COMPUTER VISION - ECCV 2018, PT VI,
2018, 11210
:38-54
[47]
Vaswani A, 2017, ADV NEUR IN, V30
[48]
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:3622-6631
[50]
CRIS: CLIP-Driven Referring Image Segmentation
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2022,
:11676-11685