共 42 条
[1]
nocaps: novel object captioning at scale
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:8947-8956
[2]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[3]
[Anonymous], 2019, Neurips
[4]
[Anonymous], ECCV
[5]
Duerig T, 2018, ARXIV181100982
[6]
Faghri Fartash, 2017, arXiv
[7]
Fang H, 2015, PROC CVPR IEEE, P1473, DOI 10.1109/CVPR.2015.7298754
[8]
Gan Zhe, 2020, ADV NEURAL INFORM PR
[9]
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:6325-6334
[10]
Hu Xiaowei, 2020, ARXIV PREPRINT ARXIV