共 86 条
[1]
Counterfactual Vision and Language Learning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10041-10051
[2]
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:4971-4980
[3]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[4]
SPICE: Semantic Propositional Image Caption Evaluation
[J].
COMPUTER VISION - ECCV 2016, PT V,
2016, 9909
:382-398
[5]
[Anonymous], 2000, Causality: models, reasoning and inference
[6]
[Anonymous], 2009, International Conference on Machine Learning, DOI [DOI 10.1145/1553374.1553463, 10.1145/1553374.1553463]
[7]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[8]
Atwood J, 2016, ADV NEUR IN, V29
[10]
Banerjee S., 2005, P ACL WORKSHOP INTRI, DOI DOI 10.3115/1626355.1626389