共 62 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
[Anonymous], 2013, 27 INT C NEUR INF PR
[3]
Global Relation-Aware Attention Network for Image-Text Retrieval
[J].
PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21),
2021,
:19-28
[4]
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:12652-12660
[5]
Learning the Best Pooling Strategy for Visual Semantic Embedding
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:15784-15793
[6]
Chen TL, 2020, AAAI CONF ARTIF INTE, V34, P10583
[9]
Diao HW, 2021, AAAI CONF ARTIF INTE, V35, P1218
[10]
Faghri F., 2018, BMVC, P1