共 55 条
- [1] Abu-El-Haija S., 2016, arXiv
- [2] ViViT: A Video Vision Transformer [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6816 - 6826
- [3] Bertasius G, 2021, PR MACH LEARN RES, V139
- [4] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
- [6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
- [7] Dosovitskiy A, 2021, INT C LEARN REPR
- [8] Games E., Unreal Engine Homepage
- [9] Vision meets robotics: The KITTI dataset [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) : 1231 - 1237
- [10] Goceri Evgin, 2020, 2020 IEEE 4th International Conference on Image Processing, Applications and Systems (IPAS), P144, DOI 10.1109/IPAS50080.2020.9334937