共 17 条
- [4] Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5120 - 5131
- [5] Efficient Medical Images Text Detection with Vision-Language Pre-training Approach ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
- [6] IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4573 - 4583
- [7] Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 309 - 327
- [10] Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 284 - 302