共 42 条
[1]
Alayrac JB, 2022, ADV NEUR IN
[2]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[3]
Brown TB, 2020, Arxiv, DOI [arXiv:2005.14165, DOI 10.48550/ARXIV.2005.14165]
[4]
Banerjee P, 2021, Arxiv, DOI arXiv:2012.02356
[5]
Black S., 2021, Softw., Metadata, V58
[6]
Bommasani R., 2021, arXiv
[7]
Changpinyo S, 2022, Arxiv, DOI arXiv:2205.01883
[8]
Prompt-RSVQA: Prompting visual context to a language model for Remote Sensing Visual Question Answering
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022,
2022,
:1371-1380
[9]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10]
Du YF, 2023, Arxiv, DOI [arXiv:2305.17006, 10.48550/arXiv.2305.17006]