共 50 条
- [1] MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 691 - 708
- [6] MimCo: Masked Image Modeling Pre-training with Contrastive Teacher PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4487 - 4495
- [10] JM-CLIP: A JOINT MODAL SIMILARITY CONTRASTIVE LEARNING MODEL FOR VIDEO-TEXT RETRIEVAL 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3010 - 3014