共 50 条
- [1] Multimodal Attention for Visual Question Answering INTELLIGENT COMPUTING, VOL 1, 2019, 858 : 783 - 792
- [2] QAlayout: Question Answering Layout Based on Multimodal Attention for Visual Question Answering on Corporate Document DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 659 - 673
- [3] Visual Question Answering based on multimodal triplet knowledge accumuation 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 81 - 84
- [4] Multimodal Learning and Reasoning for Visual Question Answering ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [5] Faithful Multimodal Explanation for Visual Question Answering BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, 2019, : 103 - 112
- [6] IQA: Visual Question Answering in Interactive Environments 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4089 - 4098
- [7] MUTAN: Multimodal Tucker Fusion for Visual Question Answering 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2631 - 2639
- [8] Multimodal Knowledge Reasoning for Enhanced Visual Question Answering 2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 224 - 230
- [9] MUREL: Multimodal Relational Reasoning for Visual Question Answering 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1989 - 1998
- [10] Multimodal Prompt Retrieval for Generative Visual Question Answering FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 2518 - 2535