共 50 条
- [2] Cross-Modal Multistep Fusion Network With Co-Attention for Visual Question Answering IEEE ACCESS, 2018, 6 : 31516 - 31524
- [3] HUMAN GUIDED CROSS-MODAL REASONING WITH SEMANTIC ATTENTION LEARNING FOR VISUAL QUESTION ANSWERING 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2775 - 2779
- [4] Cross-Modal Visual Question Answering for Remote Sensing Data 2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 57 - 65
- [5] Cross-modal Relational Reasoning Network for Visual Question Answering 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3939 - 3948
- [7] Cross-Modal Retrieval for Knowledge-Based Visual Question Answering ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 421 - 438
- [10] Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2829 - 2834