共 50 条
- [41] Text-video retrieval method based on enhanced self-attention and multi-task learning Multimedia Tools and Applications, 2023, 82 : 24387 - 24406
- [44] Utilizing Text-Video Relationships: A Text-Driven Multi-modal Fusion Framework for Moment Retrieval and Highlight Detection PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 254 - 268
- [46] CelebV-Text: A Large-Scale Facial Text-Video Dataset 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14805 - 14814
- [48] Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4626 - 4636
- [49] SPSD: Similarity-preserving self-distillation for video–text retrieval International Journal of Multimedia Information Retrieval, 2023, 12