共 52 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
Ba J, 2014, ACS SYM SER
[3]
Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval
[J].
COMPUTER VISION - ECCV 2020, PT IX,
2020, 12354
:677-694
[4]
Revisiting Approximate Metric Optimization in the Age of Deep Neural Networks
[J].
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19),
2019,
:1241-1244
[5]
Deep Metric Learning to Rank
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:1861-1870
[6]
Chakrabarti S., 2008, P 14 ACM SIGKDD C KN, P88, DOI [10.1145/140189, 0.1401906, DOI 10.1145/140189]
[7]
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:12652-12660
[8]
Learning the Best Pooling Strategy for Visual Semantic Embedding
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:15784-15793
[9]
Adaptive Offline Quintuplet Loss for Image-Text Matching
[J].
COMPUTER VISION - ECCV 2020, PT XIII,
2020, 12358
:549-565
[10]
UNITER: UNiversal Image-TExt Representation Learning
[J].
COMPUTER VISION - ECCV 2020, PT XXX,
2020, 12375
:104-120