共 81 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
[Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.470
[3]
[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00522
[4]
[Anonymous], 33 AAAI C ART INT
[5]
[Anonymous], 2018, COMPUTER VISION PATT, DOI DOI 10.1109/CVPR.2018.00447
[6]
[Anonymous], 2020, ACM INT C MULT, DOI DOI 10.1109/IPEMC-ECCEASIA48364.2020.9367932
[7]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[8]
Bengio Emmanuel, 2015, Conditional computation in neural networks for faster models
[9]
Bolukbasi T., 2017, PR MACH LEARN RES, P527
[10]
Query-guided Regression Network with Context Policy for Phrase Grounding
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:824-832