共 50 条
- [41] Modal Interaction-Enhanced Prompt Learning by Transformer Decoder for Vision-Language Models KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 163 - 174
- [42] Fine-Grained Visual Prompt Learning of Vision-Language Models for Image Recognition PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5828 - 5836
- [43] Modal interaction-enhanced prompt learning by transformer decoder for vision-language models International Journal of Multimedia Information Retrieval, 2023, 12
- [46] Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2551 - 2562
- [47] Task-Oriented Multi-Modal Mutual Learning for Vision-Language Models 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21902 - 21912
- [48] LifeGraph 4-Lifelog Retrieval using Multimodal Knowledge Graphs and Vision-Language Models PROCEEDINGS OF 2024 ACM WORKSHOP ON THE LIFELOG SEARCH CHALLENGE, LSC 2024, 2024, : 88 - 92
- [49] Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11629 - 11639
- [50] The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? COMPUTER VISION - ECCV 2024, PT XLVIII, 2025, 15106 : 127 - 142