共 118 条
[1]
Abu-El-Haija S., 2016, arXiv
[3]
[Anonymous], 2013, arXiv
[4]
[Anonymous], 1989, ADV NEURAL INFORM PR
[6]
ViViT: A Video Vision Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6816-6826
[9]
Beltagy Iz, 2020, ARXIV
[10]
Carion Nicolas, 2020, EUROPEAN C COMPUTER