共 47 条
[1]
ViViT: A Video Vision Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6816-6826
[2]
Ba J, 2014, ACS SYM SER
[3]
Babaee Khobdeh S., 2021, J APPL RES INDUST EN, V8, P412, DOI DOI 10.22105/JARIE.2021.276107.1270
[5]
Bertasius G, 2021, PR MACH LEARN RES, V139
[6]
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4724-4733
[7]
de Melo WC, 2019, IEEE INT CONF AUTOMA, P554, DOI 10.1109/fg.2019.8756568
[8]
Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
[9]
Learning Spatiotemporal Features with 3D Convolutional Networks
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:4489-4497
[10]
Video-Based Emotion Recognition using CNN-RNN and C3D Hybrid Networks
[J].
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION,
2016,
:445-450