共 80 条
[1]
Aggarwal P, 2020, Arxiv, DOI arXiv:2012.05107
[2]
[Anonymous], 2005, P MACH TRANSL SUMM 1
[3]
Compositional Learning of Image-Text Query for Image Retrieval
[J].
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021),
2021,
:1139-1148
[5]
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4724-4733
[6]
Learning Style-Invariant Robust Representation for Generalizable Visual Instance Retrieval
[J].
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023,
2023,
:6171-6180
[7]
Chang XJ, 2015, PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), P2234
[8]
Chen Guanhua, 2023, Long Papers, V1, P13028
[9]
Learning the Best Pooling Strategy for Visual Semantic Embedding
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:15784-15793
[10]
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10635-10644