共 44 条
[1]
Agarwal V., P IEEE CVF C COMP VI, P9690
[2]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[3]
[Anonymous], 2015, Simple baseline for visual question answering
[4]
[Anonymous], 2016, ARXIV160603647
[5]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[6]
Blundell C, 2015, PR MACH LEARN RES, V37, P1613
[7]
Cho K., 2014, ARXIV14061078, DOI 10.3115/v1/D14-1179
[8]
Chollet F., KERAS
[10]
Fukui A, 2016, P C EMP METH NAT LAN, P457, DOI [10.18653/v1/d16-1044, DOI 10.18653/V1/D16-1044]