共 48 条
[1]
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
[J].
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW,
2024,
:1818-1826
[2]
Chen J, 2024, Arxiv, DOI arXiv:2402.03216
[3]
MISS: A Generative Pre-training and Fine-Tuning Approach for Med-VQA
[J].
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VIII,
2024, 15023
:299-313
[4]
Chen JY, 2024, 2024 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2024, P7346
[5]
Chen T, 2020, PR MACH LEARN RES, V119
[6]
Chen Y, 2023, 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), P14948
[7]
Dettmers Tim, 2023, Advances in Neural Information Processing Systems
[8]
Eslami S, 2023, 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, P1181
[9]
FLEISS JL, 1971, PSYCHOL BULL, V76, P378, DOI 10.1037/h0031619
[10]
Gui LK, 2022, NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, P956