共 35 条
[1]
Ba JL, 2016, arXiv
[2]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[3]
Chen SZ, 2021, ADV NEUR IN, V34
[4]
Cheng ZS, 2023, Arxiv, DOI arXiv:2303.07216
[5]
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[6]
Vision-Language Transformer and Query Generation for Referring Segmentation
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:16301-16310
[7]
Ding Henghui, 2022, IEEE Trans. Pattern Anal. Mach. Intell.
[8]
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9]
Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:15501-15510
[10]
Segmentation from Natural Language Expressions
[J].
COMPUTER VISION - ECCV 2016, PT I,
2016, 9905
:108-124