共 65 条
[1]
[Anonymous], 2022, P IEEE CVF C COMP VI
[2]
[Anonymous], 2022, P IEEE CVF C COMP VI, DOI DOI 10.1109/ECTC51906.2022.00250
[3]
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:1708-1718
[4]
Bain Max, 2022, ARXIV220508508
[5]
Bengio S, 2015, ADV NEUR IN, V28
[6]
SimVQA: Exploring Simulated Environments for Visual Question Answering
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:5046-5056
[7]
Chen GB, 2017, ADV NEUR IN, V30
[8]
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10635-10644
[9]
TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:11563-11573