共 28 条
[1]
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:4971-4980
[2]
[Anonymous], P 2019 C N AM CHAPTE, DOI DOI 10.18653/V1/N19-1423
[3]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[4]
Grounding Answers for Visual Questions Asked by Visually Impaired People
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:19076-19085
[5]
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:6325-6334
[7]
VizWiz-Priv: A Dataset for Recognizing the Presence and Purpose of Private Visual Information in Images Taken by Blind People
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:939-948
[8]
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:1988-1997
[9]
Weakly Supervised Grounding for VQA in Vision-Language Transformers
[J].
COMPUTER VISION - ECCV 2022, PT XXXV,
2022, 13695
:652-670
[10]
Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:8461-8470