共 56 条
[1]
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR,
2023,
:7020-7030
[2]
Chua T.-S., 2009, P ACM INT C IM VID R, P1
[3]
Cong Bai, 2020, ICMR '20: Proceedings of the 2020 International Conference on Multimedia Retrieval, P525, DOI 10.1145/3372278.3390711
[5]
Collective Matrix Factorization Hashing for Multimodal Data
[J].
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2014,
:2083-2090
[6]
He Shiyuan, 2022, IEEE Trans. Knowl. data Eng.
[7]
Hinton G E., 2012, arXiv
[8]
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[9]
Creating Something from Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2020,
:3120-3129
[10]
Huiskes M. J., 2008, Proceedings of the 1st ACM international conference on Multimedia information retrieval, P39