共 50 条
- [11] Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2829 - 2834
- [12] Structured Attentions for Visual Question Answering 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1300 - 1309
- [15] Gated Multi-modal Fusion with Cross-modal Contrastive Learning for Video Question Answering ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 427 - 438
- [16] Deep Semantic Correlation with Adversarial Learning for Cross-Modal Retrieval PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, : 252 - 255
- [18] Cross-Modal Multistep Fusion Network With Co-Attention for Visual Question Answering IEEE ACCESS, 2018, 6 : 31516 - 31524
- [19] Cross-Modal Feature Distribution Calibration for Few-Shot Visual Question Answering THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7151 - 7159