共 28 条
- [1] Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4971 - 4980
- [2] Al-Ajlan A., 2015, INT J MACHINE LEARNI
- [3] Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3674 - 3683
- [4] [Anonymous], 2018, NIPS
- [5] VQA: Visual Question Answering [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
- [6] MUTAN: Multimodal Tucker Fusion for Visual Question Answering [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2631 - 2639
- [7] Cadene Remi., 2019, NIPS
- [8] Chen L, 2020, PROC CVPR IEEE, P10797, DOI 10.1109/CVPR42600.2020.01081
- [9] Cheng Z., 2021, IJCAI
- [10] Clark Christopher, 2019, EMNLP