共 68 条
- [1] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
- [2] [Anonymous], 2018, IEEE T NEUR NET LEAR, DOI DOI 10.1109/TNNLS.2018.2817340
- [3] [Anonymous], 2016, P INT C LEARN REPR
- [4] [Anonymous], 2015, P 3 INT C LEARN REPR
- [5] [Anonymous], 2017, INT C LEARNING REPRE
- [6] VQA: Visual Question Answering [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
- [7] Bahdanau D., 2014, 3 INT C LEARN REPR
- [8] Bai S., 2018, ARXIV
- [9] Visual Question Reasoning on General Dependency Tree [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7249 - 7257
- [10] Learning Aligned Cross-Modal Representations from Weakly Aligned Data [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2940 - 2949