共 62 条
- [1] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
- [2] [Anonymous], 2016, PROC C EMPIRICAL MET
- [3] [Anonymous], 2012, Proceedings of the 21st International Conference on World Wide Web
- [4] VQA: Visual Question Answering [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
- [5] Character Region Awareness for Text Detection [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9357 - 9366
- [6] MUTAN: Multimodal Tucker Fusion for Visual Question Answering [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2631 - 2639
- [7] Biten Ali Furkan, 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR). Proceedings, P1563, DOI 10.1109/ICDAR.2019.00251
- [8] Scene Text Visual Question Answering [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4290 - 4300
- [10] Reading Wikipedia to Answer Open-Domain Questions [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1870 - 1879