共 90 条
[1]
ViViT: A Video Vision Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6816-6826
[2]
Ba J, 2014, ACS SYM SER
[3]
BACK T, 1996, EVOLUTIONARY ALGORIT
[4]
Cao CQ, 2022, Arxiv, DOI [arXiv:2202.06503, 10.1109/LSP.2022.3226411]
[5]
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4724-4733
[6]
Video Moment Retrieval from Text Queries via Single Frame Annotation
[J].
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22),
2022,
:1033-1043
[9]
Adaptive Token Sampling for Efficient Vision Transformers
[J].
COMPUTER VISION, ECCV 2022, PT XI,
2022, 13671
:396-414
[10]
MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:14004-14013