共 27 条
- [4] Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5120 - 5131
- [5] Efficient Medical Images Text Detection with Vision-Language Pre-training Approach ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
- [7] Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 309 - 327
- [9] Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 284 - 302