共 30 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
[Anonymous], 2015, Natrue, DOI DOI 10.1038/NATURE14539
[3]
BO D, 2021, ARXIV VOL ABS 2101 0
[4]
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:12652-12660
[5]
Faghri F., 2018, P BRIT MACH VIS C BM
[7]
HINTON GE, 2015, ARXIV VOL ABS 1503 0
[8]
HOBBS JR, 1994, NATURAL LANGUAGE PRO
[10]
Learning Semantic Concepts and Order for Image and Sentence Matching
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6163-6171