共 65 条
[21]
Stacked Cross Attention for Image-Text Matching
[J].
COMPUTER VISION - ECCV 2018, PT IV,
2018, 11208
:212-228
[23]
Visual Semantic Reasoning for Image-Text Matching
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4653-4661
[24]
Identity-Aware Textual-Visual Matching with Latent Co-attention
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:1908-1917
[25]
Visual-Semantic Matching by Exploring High-Order Attention and Distraction
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:12783-12792
[26]
Microsoft COCO: Common Objects in Context
[J].
COMPUTER VISION - ECCV 2014, PT V,
2014, 8693
:740-755
[27]
Liu C., 2020, P IEEECVF C COMPUTER, P10921
[28]
Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
[J].
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19),
2019,
:3-11
[30]
Social Relation Recognition from Videos via Multi-scale Spatial-Temporal Reasoning
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:3561-3569