共 324 条
[1]
nocaps: novel object captioning at scale
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:8947-8956
[2]
Alayrac JB, 2022, ADV NEUR IN
[5]
Anil R, 2023, Arxiv, DOI [arXiv:2305.10403, DOI 10.48550/ARXIV.2305.10403, 10.48550/arXiv.2305.10403]
[6]
Bach S.H., 2022, Promptsource: An integrated development environment and repository for natural language prompts
[7]
Bai JZ, 2023, Arxiv, DOI arXiv:2308.12966
[8]
Bai Yuntao, 2022, arXiv
[9]
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:1708-1718
[10]
Baumgartner J., 2020, P INT AAAI C WEB SOC, P830, DOI DOI 10.48550/ARXIV.2001.08435