共 50 条
- [1] Critical Analysis of Deconfounded Pretraining to Improve Visio-Linguistic Models FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
- [2] DeVLBert: Learning Deconfounded Visio-Linguistic Representations MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4373 - 4382
- [3] DeVLBert: Out-of-distribution Visio-Linguistic Pretraining with Causality 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1744 - 1747
- [4] What You Say Is Not What You Do: Studying Visio-Linguistic Models for TV Series Summarization 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3142 - 3146
- [5] Text encoders bottleneck compositionality in contrastive vision-language models 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 4933 - 4944
- [6] (sic) ECHO: A Visio-Linguistic Dataset for Event Causality Inference via Human-Centric ReasOning FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4064 - 4085
- [7] Contrasting intra-modal and ranking cross-modal hard negatives to enhance visio-linguistic compositional understanding 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13774 - 13784
- [8] ML2MG-VLCR: A Multimodal LLM Guided Zero-shot Method for Visio-linguistic Compositional Reasoning with Autoregressive Generative Language Model PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 842 - 850
- [10] Measuring and Narrowing the Compositionality Gap in Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 5687 - 5711