共 50 条
- [22] MUTAN: Multimodal Tucker Fusion for Visual Question Answering 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2631 - 2639
- [23] Multimodal Knowledge Reasoning for Enhanced Visual Question Answering 2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 224 - 230
- [24] MUREL: Multimodal Relational Reasoning for Visual Question Answering 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1989 - 1998
- [25] Health-Oriented Multimodal Food Question Answering MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 191 - 203
- [26] Unifying Text, Tables, and Images for Multimodal Question Answering FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 9355 - 9367
- [27] Multimodal Prompt Retrieval for Generative Visual Question Answering FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 2518 - 2535
- [28] Dealing with spoken requests in a multimodal Question Answering system ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, 2008, 5253 : 93 - 102
- [29] QAlayout: Question Answering Layout Based on Multimodal Attention for Visual Question Answering on Corporate Document DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 659 - 673
- [30] VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21525 - 21535