共 50 条
- [41] InteraRec: Interactive Recommendations Using Multimodal Large Language Models TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2024 WORKSHOPS, RAFDA AND IWTA, 2024, 14658 : 32 - 43
- [42] Exploring the Transferability of Visual Prompting for Multimodal Large Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 26552 - 26562
- [43] Enhancing Urban Walkability Assessment with Multimodal Large Language Models COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024 WORKSHOPS, PT V, 2024, 14819 : 394 - 411
- [45] UniCode: Learning a Unified Codebook for Multimodal Large Language Models COMPUTER VISION - ECCV 2024, PT VIII, 2025, 15066 : 426 - 443
- [46] QueryMintAI: Multipurpose Multimodal Large Language Models for Personal Data IEEE ACCESS, 2024, 12 : 144631 - 144651
- [47] BLINK: Multimodal Large Language Models Can See but Not Perceive COMPUTER VISION - ECCV 2024, PT XXIII, 2025, 15081 : 148 - 166
- [48] Multimodal Large Language Models as Built Environment Auditing Tools PROFESSIONAL GEOGRAPHER, 2025, 77 (01): : 84 - 90
- [49] Align is not Enough: Multimodal Universal Jailbreak Attack against Multimodal Large Language Models IEEE Transactions on Circuits and Systems for Video Technology,
- [50] Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 14196 - 14210