共 29 条
- [1] Effective conditioned and composed image retrieval combining CLIP-based features [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21434 - 21442
- [2] Visual Dialog [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1080 - 1089
- [3] Guo XX, 2018, ADV NEUR IN, V31
- [4] Karthik S, 2024, Arxiv, DOI arXiv:2310.09291
- [5] Levy M, 2023, Arxiv, DOI arXiv:2305.20062
- [6] Levy M, 2023, Arxiv, DOI arXiv:2303.09429
- [7] Li JN, 2022, PR MACH LEARN RES
- [8] Li JN, 2023, Arxiv, DOI [arXiv:2301.12597, DOI 10.48550/ARXIV.2301.12597]
- [9] Microsoft COCO: Common Objects in Context [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
- [10] Liu H., 2024, Advances in Neural Information Processing Systems, V36