共 63 条
[2]
Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
[3]
ViViT: A Video Vision Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6816-6826
[4]
Ba J. L., 2016, arXiv, DOI 10.48550/arXiv:1607.06450
[5]
Bertasius G, 2021, PR MACH LEARN RES, V139
[6]
Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2020,
:4612-4622
[7]
Carreira J., 2018, arXiv
[8]
Elaborative Rehearsal for Zero-shot Action Recognition
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:13618-13627
[9]
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]
Dosovitskiy A., 2021, P 9 INT C LEARN REPR