共 55 条
- [1] Abnar S, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P4190
- [2] Akbari Hassan, 2021, ARXIV210411178
- [3] Andoni A, 2015, ADV NEUR IN, V28
- [4] [Anonymous], 2010, P 18 ACM INT C MULT
- [5] [Anonymous], 2020, P IEEE CVF C COMP VI
- [6] ViViT: A Video Vision Transformer [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6816 - 6826
- [7] Bertasius Gedas, 2021, arXiv
- [8] Carion N., 2020, EUR C COMP VIS SPRIN, P213
- [9] Carreira J., 2018, arXiv
- [10] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733