共 11 条
- [2] Dense Contrastive Visual-Linguistic Pretraining PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5203 - 5212
- [4] Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5618 - 5627
- [5] Weakly-Supervised Grounding for VQA with Dual Visual-Linguistic Interaction ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 156 - 169
- [6] Triangle-Reward Reinforcement Learning: Visual-Linguistic Semantic Alignment for Image Captioning PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4510 - 4518
- [10] ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13342 - 13357