共 50 条
- [41] FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 1330 - 1350
- [42] Dual-Key Multimodal Backdoors for Visual Question Answering 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15354 - 15364
- [43] Multimodal Graph Networks for Compositional Generalization in Visual Question Answering ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [44] Contrastive training of a multimodal encoder for medical visual question answering INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 18
- [47] Visual Question Answering based on multimodal triplet knowledge accumuation 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 81 - 84
- [50] Improving Visual Question Answering by Multimodal Gate Fusion Network 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,