共 32 条
[1]
Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings
[J].
ACM/SIGIR PROCEEDINGS 2018,
2018,
:35-44
[2]
MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images with Latent Variable Model
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:14558-14568
[4]
Deep Reconstruction-Classification Networks for Unsupervised Domain Adaptation
[J].
COMPUTER VISION - ECCV 2016, PT IV,
2016, 9908
:597-613
[5]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[6]
Hochreiter S., 1997, Neural Computation, V9, P1735
[8]
Gulrajani I, 2017, ADV NEUR IN, V30
[9]
Stacked Cross Attention for Image-Text Matching
[J].
COMPUTER VISION - ECCV 2018, PT IV,
2018, 11208
:212-228
[10]
Lu JS, 2019, ADV NEUR IN, V32