共 41 条
[11]
CTR-Driven Advertising Image Generation with Multimodal Large Language Models
[J].
PROCEEDINGS OF THE ACM WEB CONFERENCE 2025, WWW 2025,
2025,
:2262-2275
[14]
FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models
[J].
COMPUTER VISION - ECCV 2024, PT XXIII,
2025, 15081
:403-421
[17]
FoodMLLM-JP: Leveraging Multimodal Large Language Models for Japanese Recipe Generation
[J].
MULTIMEDIA MODELING, MMM 2025, PT I,
2025, 15520
:401-414
[18]
Leveraging Multimodal Large Language Models for Enhanced Learning and Application in Building Energy Modeling
[J].
MULTIPHYSICS AND MULTISCALE BUILDING PHYSICS, IBPC 2024, VOL 3,
2025, 554
:611-618
[20]
Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models
[J].
COMPUTER VISION - ECCV 2024, PT LXXIII,
2025, 15131
:174-189