共 50 条
- [32] Learning Visual Question Answering by Bootstrapping Hard Attention COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 3 - 20
- [33] VISUAL QUESTION ANSWERING IN REMOTE SENSING WITH CROSS-ATTENTION AND MULTIMODAL INFORMATION BOTTLENECK IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6278 - 6281
- [34] Multimodal Dual Attention Memory for Video Story Question Answering COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 698 - 713
- [35] Survey on Visual Question Answering Ruan Jian Xue Bao/Journal of Software, 2021, 32 (08): : 2522 - 2544
- [37] Multimodal Local Perception Bilinear Pooling for Visual Question Answering IEEE ACCESS, 2018, 6 : 57923 - 57932
- [39] Improving Visual Question Answering by Multimodal Gate Fusion Network 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,