共 50 条
- [21] VISUAL QUESTION ANSWERING IN REMOTE SENSING WITH CROSS-ATTENTION AND MULTIMODAL INFORMATION BOTTLENECK IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6278 - 6281
- [22] Bi-Modal Transformer-Based Approach for Visual Question Answering in Remote Sensing Imagery IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
- [23] Deep Cross-Modal ImageVoice Retrieval in Remote Sensing IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (10): : 7049 - 7061
- [24] Embedding Spatial Relations in Visual Question Answering for Remote Sensing 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 310 - 316
- [25] Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1097 - 1103
- [26] Gated Multi-modal Fusion with Cross-modal Contrastive Learning for Video Question Answering ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 427 - 438
- [27] Enhancing Visual Question Answering with Prompt-based Learning: A Cross-modal Approach for Deep Semantic Understanding PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024, 2024, : 713 - 717
- [29] VCD: Visual Causality Discovery for Cross-Modal Question Reasoning PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VII, 2024, 14431 : 309 - 322