共 38 条
[1]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[2]
Effective conditioned and composed image retrieval combining CLIP-based features
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:21434-21442
[3]
Conditioned and composed image retrieval combining and partially fine-tuning CLIP-based features
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022,
2022,
:4955-4964
[4]
Berg TL, 2010, LECT NOTES COMPUT SC, V6311, P663, DOI 10.1007/978-3-642-15549-9_48
[5]
Brown TB, 2020, ADV NEUR IN, V33
[6]
Chen GB, 2017, ADV NEUR IN, V30
[7]
Cohen Niv, 2022, P EUR C COMP VIS ECC
[8]
Cornia M, 2020, PROC CVPR IEEE, P10575, DOI 10.1109/CVPR42600.2020.01059
[9]
Daras Giannis, 2022, NEURIPS 2022 WORKSH
[10]
Delmas Ginger, 2022, P INT C LEARN REPR I