共 45 条
[1]
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:12652-12660
[2]
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, 10.48550/arxiv.1810.04805]
[3]
Linking Image and Text with 2-Way Nets
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:1855-1865
[4]
Faghri F, 2018, Arxiv, DOI arXiv:1707.05612
[5]
Cross-modal Retrieval with Correspondence Autoencoder
[J].
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14),
2014,
:7-16
[6]
Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
[J].
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021,
2021,
:5185-5193
[7]
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:7181-7189
[9]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[10]
Hotelling H, 1936, BIOMETRIKA, V28, P321, DOI 10.2307/2333955