共 36 条
[11]
Huang PY, 2021, Arxiv, DOI arXiv:2103.08849
[12]
Huang Zhenyu, 2021, Advances in Neural Information Processing Systems, V34
[13]
Jain A, 2021, Arxiv, DOI arXiv:2109.05125
[14]
Partially Relevant Video Retrieval
[J].
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022,
2022,
[15]
Kim Jae Myung, 2023, P IEEE CVF C COMP VI, P2584
[17]
SViTT: Temporal Learning of Sparse Video-Text Transformers
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:18919-18929
[20]
M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:3976-3985