共 25 条
- [1] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
- [2] VQA: Visual Question Answering [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
- [3] Bordes A., 2013, ADV NEURAL INFORM PR, V2013, P2787, DOI DOI 10.5555/2999792.2999923
- [4] Chen Z., 2021, Zero-shot visual question answering using knowledge graph
- [5] Fukui A., 2016, ARXIV160601847
- [6] Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6325 - 6334
- [7] Deep Residual Learning for Image Recognition [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
- [8] Hochreiter S., 1995, Long short term memory
- [9] Answer-Type Prediction for Visual Question Answering [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4976 - 4984
- [10] Kim JH, 2018, ADV NEUR IN, V31