共 88 条
[11]
Fei H, 2023, PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, P5980
[12]
Fei-Fei L., 2004, COMPUT VIS IMAGE UND, P178, DOI DOI 10.1016/J.CVIU.2005.09.012
[14]
Guo ZY, 2023, Arxiv, DOI arXiv:2309.00615
[15]
MVTN: Multi-View Transformation Network for 3D Shape Recognition
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:1-11
[16]
Li LH, 2019, Arxiv, DOI [arXiv:1908.03557, 10.48550/arXiv.1908.03557, DOI 10.48550/ARXIV.1908.03557]
[17]
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition
[J].
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW,
2023,
:2020-2030
[18]
Houlsby N, 2019, PR MACH LEARN RES, V97
[19]
Prompting Visual-Language Models for Efficient Video Understanding
[J].
COMPUTER VISION - ECCV 2022, PT XXXV,
2022, 13695
:105-124
[20]
Lee S ..., 2022, P ADV NEUR INF PROC, P23580