共 28 条
- [1] Towards Adversarial Attack on Vision-Language Pre-training Models PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5005 - 5013
- [2] Transferable Multimodal Attack on Vision-Language Pre-training Models 45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, : 1722 - 1740
- [3] LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [4] LiFT: Transfer Learning in Vision-Language Models for Downstream Adaptation and Generalization PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4678 - 4687
- [5] HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23507 - 23517
- [7] Contrastive Region Guidance: Improving Grounding in Vision-Language Models Without Training COMPUTER VISION - ECCV 2024, PT LXXIX, 2025, 15137 : 198 - 215
- [8] Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2443 - 2459
- [9] GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training Methods 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10951 - 10961