共 50 条
- [31] X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4996 - 5005
- [32] Fine-grained Cross-modal Alignment Network for Text-Video Retrieval PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3826 - 3834
- [33] MGSGA: Multi-grained and Semantic-Guided Alignment for Text-Video Retrieval Neural Processing Letters, 56
- [36] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6565 - 6574
- [38] CMFG: Cross-Model Fine-Grained Feature Interaction for Text-Video Retrieval MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 435 - 445
- [39] T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5075 - 5084