共 65 条
[51]
Contextual Similarity Distillation for Asymmetric Image Retrieval
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2022,
:9479-9488
[52]
HANet: Hierarchical Alignment Networks for Video-Text Retrieval
[J].
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021,
2021,
:3518-3527
[53]
Xiang Wangmeng, 2022, ARXIV220805318
[54]
Visual Relation Grounding in Videos
[J].
COMPUTER VISION - ECCV 2020, PT VI,
2020, 12351
:447-464
[55]
Xu Mengde, 2021, arXiv preprint arXiv:2112.14757
[56]
Yan Shuanglin, 2022, ARXIV221010276
[60]
Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval
[J].
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20),
2020,
:1339-1348