共 35 条
[2]
Linking Image and Text with 2-Way Nets
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:1855-1865
[3]
Faghri Fartash, 2018, BRIT MACH VIS C
[4]
Stacked Latent Attention for Multimodal Reasoning
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:1072-1080
[5]
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:7181-7189
[6]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[7]
Herzig R, 2018, ADV NEUR IN, V31
[9]
Learning Semantic Concepts and Order for Image and Sentence Matching
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6163-6171
[10]
Instance-aware Image and Sentence Matching with Selective Multimodal LSTM
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:7254-7262