共 69 条
- [1] Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12479 - 12488
- [2] Abu-El-Haija Sami, 2016, Youtube-8m: A large-scale video classification benchmark
- [3] Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey [J]. IEEE ACCESS, 2018, 6 : 14410 - 14430
- [4] [Anonymous], 2019, ICML
- [5] Ardulov Victor, 2021, SCI REPORTS, V11, P1
- [7] Bertasius G, 2021, PR MACH LEARN RES, V139
- [8] Understanding Robustness of Transformers for Image Classification [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10211 - 10221
- [9] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
- [10] Carreira Joao, 2018, QUO VADIS ACTION REC