共 57 条
[1]
[Anonymous], 2015, INT C COMP VIS ICCV
[2]
[Anonymous], 2011, C NEUR INF PROC SYST
[3]
[Anonymous], 2016, Exploring the limits of language modeling
[4]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[5]
Bao Hangbo, 2021, BEIT BERT PRE TRAINI
[6]
Bugliarello Emanuele, 2021, T ASS COMPUTATIONAL
[7]
Changpinyo Soravit, 2021, C COMP VIS PATT REC
[8]
UNITER: UNiversal Image-TExt Representation Learning
[J].
COMPUTER VISION - ECCV 2020, PT XXX,
2020, 12375
:104-120
[9]
Cho J, 2021, PR MACH LEARN RES, V139
[10]
Clark K., 2020, P 8 INT C LEARNING R, P1