共 57 条
[22]
Stacked Cross Attention for Image-Text Matching
[J].
COMPUTER VISION - ECCV 2018, PT IV,
2018, 11208
:212-228
[23]
Visual Semantic Reasoning for Image-Text Matching
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4653-4661
[24]
Relation-Aware Graph Attention Network for Visual Question Answering
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:10312-10321
[25]
Identity-Aware Textual-Visual Matching with Latent Co-attention
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:1908-1917
[26]
Microsoft COCO: Common Objects in Context
[J].
COMPUTER VISION - ECCV 2014, PT V,
2014, 8693
:740-755
[27]
Leveraging Visual Question Answering for Image-Caption Ranking
[J].
COMPUTER VISION - ECCV 2016, PT II,
2016, 9906
:261-277
[28]
Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
[J].
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19),
2019,
:3-11
[30]
Lu JS, 2019, ADV NEUR IN, V32