共 50 条
- [23] data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [25] Siamese Image Modeling for Self-Supervised Vision Representation Learning 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2132 - 2141
- [26] Jointly Optimal Incremental Learning with Self-Supervised Vision Transformers 2024 IEEE AEROSPACE CONFERENCE, 2024,
- [28] Language Features Matter: Effective Language Representations for Vision-Language Tasks 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7473 - 7482
- [30] Weakly Supervised Grounding for VQA in Vision-Language Transformers COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 652 - 670