共 50 条
- [2] An Empirical Study of Frame Selection for Text-to-Video Retrieval FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 6821 - 6832
- [3] Holistic Features are almost Sufficient for Text-to-Video Retrieval 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17138 - 17147
- [4] Factorizing Text-to-Video Generation by Explicit Image Conditioning COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 205 - 224
- [6] Visual to Text: Survey of Image and Video Captioning IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2019, 3 (04): : 297 - 312
- [7] Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5207 - 5214
- [8] Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 444 - 461
- [9] Write What YouWant: Applying Text-to-Video Retrieval to Audiovisual Archives ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2023, 16 (04):
- [10] Fighting FIRe with FIRE: Assessing the Validity of Text-to-Video Retrieval Benchmarks 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 47 - 68