共 77 条
[2]
Bellver M, 2020, Arxiv, DOI arXiv:2010.00263
[3]
Bruna J., 2013, SPECTRAL NETWORKS LO
[5]
Language-Based Image Editing with Recurrent Attentive Models
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:8721-8729
[6]
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]
[8]
Multi-Modal Dynamic Graph Transformer for Visual Grounding
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:15513-15522
[10]
Defferrard M, 2016, ADV NEUR IN, V29