共 68 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
[Anonymous], 2018, IEEE T NEUR NET LEAR, DOI DOI 10.1109/TNNLS.2018.2817340
[3]
[Anonymous], 2010, P 18 ACM INT C MULT, DOI 10.1145/1873951.1873987
[4]
[Anonymous], 2016, P INT C LEARN REPR
[5]
[Anonymous], 2015, ARXIV151202167V2
[6]
[Anonymous], 2017, INT C LEARNING REPRE
[7]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[8]
Bahdanau D., 2015, P INT C MACH LEARN I
[9]
Bai S., 2018, An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
[10]
Visual Question Reasoning on General Dependency Tree
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:7249-7257