共 70 条
[1]
Alayrac JB, 2022, ADV NEUR IN
[2]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[3]
[Anonymous], 2012, CVPR
[4]
Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29
[5]
Brown TB, 2020, ADV NEUR IN, V33
[6]
Carion N, 2020, Img Proc Comp Vis Re, V12346, P213, DOI 10.1007/978-3-030-58452-8_13
[7]
Chen J, 2022, ADV NEUR IN
[8]
Chen Yen-Chun, 2020, ECCV
[9]
Describing Textures in the Wild
[J].
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2014,
:3606-3613
[10]
Conneau A., 2020, ACL, P8440, DOI DOI 10.18653/V1/2020.ACL-MAIN.747