共 49 条
[31]
Santoro A, 2017, ADV NEUR IN, V30
[32]
Where To Look: Focus Regions for Visual Question Answering
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:4613-4621
[33]
Tai KS, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P1556
[34]
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:4223-4232
[35]
Graph-Structured Representations for Visual Question Answering
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3233-3241
[37]
Xiong CM, 2016, PR MACH LEARN RES, V48
[38]
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
[J].
COMPUTER VISION - ECCV 2016, PT VII,
2016, 9911
:451-466
[39]
Yang D, 2026, P 2016 C EMP METH NA, P457, DOI [10.18653/v1/d16-1044, DOI 10.18653/V1/D16-1044, 10.18653/v1/D16-1044]
[40]
Stacked Attention Networks for Image Question Answering
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:21-29