共 39 条
[1]
Agarwal S, 2018, P 11 INT C NAT LANG, ppp129
[2]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[3]
Learning visual similarity for product design with convolutional neural networks
[J].
ACM TRANSACTIONS ON GRAPHICS,
2015, 34 (04)
[4]
Bojanowski P., 2016, VALENCIA SPAIN ACL, V2, P427
[5]
Bojanowski P, 2017, Transactions of the Association for Computational Linguistics, V5, P135, DOI [DOI 10.1162/TACL_A_00051, 10.1162/tacla00051]
[6]
Chauhan H, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5437
[8]
Visual Dialog
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:1080-1089
[9]
GuessWhat?! Visual object discovery through multi-modal dialogue
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4466-4475
[10]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171