共 37 条
[1]
Akula AR, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P2148
[2]
Alayrac JB, 2022, ADV NEUR IN
[3]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[4]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[6]
Firat M., 2023, Journal of Applied Learning and Teaching, V6, DOI DOI 10.37074/JALT.2023.6.1.22
[8]
From Images to Textual Prompts: Zero-shot Visual Question Answering with Frozen Large Language Models
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:10867-10877