共 66 条
- [1] Andreas J., 2016, NAACL, P1545
- [2] Neural Module Networks [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 39 - 48
- [3] [Anonymous], 2015, Deep captioning with multimodal recurrent neural networks (mRNN)
- [4] [Anonymous], 2015, ADV NEURAL INFORM PR
- [5] [Anonymous], Simple baseline for visual question answering
- [6] [Anonymous], 2015, NEURAL INFORM PROCES
- [7] Mask R-CNN [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2980 - 2988
- [8] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
- [9] Chelba Ciprian, 2014, One billion word benchmark for measuring progress in statistical language modeling, DOI DOI 10.21437/INTERSPEECH.2014-564
- [10] Chen YP, 2018, ADV NEUR IN, V31