共 27 条
- [21] Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5038 - 5047
- [22] ClusterE-ZSL: A Novel Cluster-Based Embedding for Enhanced Zero-Shot Learning in Contrastive Pre-Training Cross-Modal Retrieval IEEE ACCESS, 2024, 12 : 162622 - 162637
- [23] Retrieval-based Knowledge Augmented Vision Language Pre-training PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5399 - 5409
- [24] A3R: Vision Language Pre-training by Attentive Alignment and Attentive Reconstruction PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 129 - 142
- [25] ZEN-IQA: Zero-Shot Explainable and No-Reference Image Quality Assessment With Vision Language Model IEEE ACCESS, 2024, 12 : 70973 - 70983