共 50 条
- [21] Faithful Multimodal Explanation for Visual Question Answering BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, 2019, : 103 - 112
- [22] Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVIII, 2022, 12267
- [23] Learning to Specialize with Knowledge Distillation for Visual Question Answering ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [25] MUTAN: Multimodal Tucker Fusion for Visual Question Answering 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2631 - 2639
- [26] Multimodal Knowledge Reasoning for Enhanced Visual Question Answering 2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 224 - 230
- [27] MUREL: Multimodal Relational Reasoning for Visual Question Answering 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1989 - 1998
- [28] Multimodal Prompt Retrieval for Generative Visual Question Answering FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 2518 - 2535
- [29] SEGMENTATION-GUIDED ATTENTION FOR VISUAL QUESTION ANSWERING FROM REMOTE SENSING IMAGES IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 2750 - 2754
- [30] OVERCOMING LANGUAGE BIAS IN REMOTE SENSING VISUAL QUESTION ANSWERING VIA ADVERSARIAL TRAINING IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 2235 - 2238