共 50 条
- [1] Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3190 - 3199
- [2] Prompt Tuning for Unified Multimodal Pretrained Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 402 - 416
- [3] Visual Commonsense in Pretrained Unimodal and Multimodal Models NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5321 - 5335
- [4] Point-Cloud Completion with Pretrained Text-to-image Diffusion Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [5] Transferring General Multimodal Pretrained Models to Text Recognition FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 588 - 597
- [8] Multimodal Data Augmentation for Image Captioning using Diffusion Models PROCEEDINGS OF THE 1ST WORKSHOP ON LARGE GENERATIVE MODELS MEET MULTIMODAL APPLICATIONS, LGM3A 2023, 2023, : 23 - 33