共 12 条
- [1] SkyScapes - Fine-Grained Semantic Understanding of Aerial Scenes [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7392 - 7402
- [2] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
- [3] Segmentation from Natural Language Expressions [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 108 - 124
- [4] Bi-directional Relationship Inferring Network for Referring Image Segmentation [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4423 - 4432
- [5] Loshchilov I., 2018, INT C LEARN REPR, DOI DOI 10.48550/ARXIV.1711.05101
- [6] Sumbul G., 2020, IEEE T GEOSCIENCE RE
- [7] Xiong Zhitong, 2022, ARXIV
- [8] Xiong Zhitong, 2024, ARXIV
- [9] LAVT: Language-Aware Vision Transformer for Referring Image Segmentation [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18134 - 18144
- [10] Cross-Modal Self-Attention Network for Referring Image Segmentation [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10494 - 10503