共 36 条
[1]
Sequence-to-Sequence Contrastive Learning for Text Recognition
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:15297-15307
[2]
[Anonymous], 2006, P IEEE COMP SOC C CO, P1735
[3]
Bellver M, 2020, Arxiv, DOI arXiv:2010.00263
[4]
Botach A., 2021, arXiv
[5]
Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:7012-7021
[6]
Vision-Language Transformer and Query Generation for Referring Segmentation
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:16301-16310
[7]
Ding Zihan, 2021, 3 LARG SCAL VID OBJ, P7
[8]
Dozat T, 2017, Arxiv, DOI [arXiv:1611.01734, 10.48550/arXiv.1611.01734, DOI 10.48550/ARXIV.1611.01734]
[9]
Actor and Action Video Segmentation from a Sentence
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:5958-5966
[10]
Referring Image Segmentation via Cross-Modal Progressive Comprehension
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10485-10494